Highly phosphorylated acid beta-glucocerebrosidase and methods of treating gaucher&#39;s disease

ABSTRACT

The present invention provides a highly phosphorylated acid beta-glucocerebrosidase (GBA), which can be employed in an enzyme replacement therapy protocol to treat patients suffering from Gaucher&#39;s disease.

BACKGROUND OF THE INVENTION

[0001] 1. Field of the Invention

[0002] The present invention is directed to a highly phosphorylated acid beta-glucocerebrosidase (GBA), which can be employed in an enzyme replacement therapy protocol to treat patients suffering from Gaucher's disease.

[0003] 2. Discussion of the Background

[0004] Gaucher's disease is a lysosomal storage disease believed to be caused by a deficiency of acid β-glucocerbrosidase (GBA)(Friedman, B., et al (1999) Blood, 93, 2807-2816). Most lysosomal enzymes are targeted to the lysosome by the mannose 6-phosphate (M6P) dependent pathway. In order for lysosomal enzymes to be targeted to the lysosome they must first acquire, through post-translational modification the M6P residues essential for targeting. GBA is an exception in that it is not believed to be naturally targeted through the M6P pathway or is it completely understood how it is targeted.

[0005] These post-translational modifications may be carried out by the sequential action of two enzymes: N-acetylglucosaminylphosphotransferase (GlcNAc-phosphotransferase) and N-acetylglucosamine-1-phosphodiester α-N-acetylglucosaminidase (Uncovering enzyme; UCE) (Varki, A. P., et al (1981) Proc. Natl. Acad. Sci. USA., 78, 7773-7777).

[0006] GlcNAc-phosphotransferase catalyzes the transfer of N-acetylglucosamine-1-phosphate from UDP-GlcNAc to the 6 position of 1,2-linked mannoses on the lysosomal enzyme. The recognition and addition of N-acetylgluocosamine-1-phosphate to lysosomal hydrolases by GlcNAc-phosphotransferase is the critical and determining step in lysosomal targeting. The second step is catalyzed by N-acetylglucosamine-1-phosphodiester α-N-Acetylglucosaminidase (“phosphodiester α-GlcNAcase”) (E.C. 3.1.4.45). Phosphodiester α-GlcNAcase catalyzes the removal of N-Acetylglucosamine from the GlcNAc-phosphate modified lysosomal enzyme to generate a terminal M6P on the lysosomal enzyme.

[0007] Both enzymes responsible for the terminal M6P on the lysosomal enzymes have been previously isolated and characterized (U.S. Pat. Ser. Nos. 09/636,077, 09/636,596, 09/635,872 and 09/636,060, incorporated herein by reference). During normal cellular processes, GBA is not typically a substrate for the GlcNAc phosphotransferase/phosphodiester α-GlcNAcase modification pathway.

[0008] Currently, the GBA used in enzyme replacement therapy is modified so that it contains terminal mannose moieties (2 GlcNAc and 3 mannose) that facilitate GBA targeting to tissues via the high affinity mannose receptor located on the surface of some macrophages (Friedman et (1999) Blood:93(9):2807-2816). A problem that exist with the current GBA enzyme replacement therapy is that affected tissues such as bone and lung in which the enzyme is unable to reach because these tissues do not contain the proper macrophages to allow efficient targeting (Beutler, E. et al (1995) Mol Med, 1, 320-324). Thus, while the tissues that are targeted by the current GBA, e.g., liver and spleen, receive some benefit from the replacement therapy, the tissues that are not targeted, e.g., bone and lung, suffer from long-term deficiencies such as pulmonary hypertension and progressive bone disease (Gaucher's bone disease) (see, for example, Beutler et al (1995) Mol. Med., 1:320-324).

[0009] To address these problems with current GBA replacement therapy, GBA will be phosphorylated which will allow binding to mannose 6 receptors on the surface of lung and bone cells. In so binding to the receptor on these tissues the problems of the current GBA replacement therapy can be addressed. Therefore, the highly phosphorylated GBA when employed in therapeutic protocols will increase the amount of GBA in the targeted bone and lung tissues resulting in improvements for the long-term prospects of Gaucher's patients. (Friedman et al (1999) Blood:93(9):2807-2816).

[0010] The present inventors have discovered that GlcNAc phosphotransferase, comprising the α and β subunits reduces substrate specificity, which allows the GlcNAc phosphotransferase to catalyze the transfer of N-acetylglucosamine-1-phosphate from UDP-GlcNAc to the GBA enzyme. This modified GBA may then be treated with phosphodiester α-GlcNAcase to complete the modification of the GBA thereby making the enzyme available for targeting tissues via the M6P receptor.

[0011] This modified enzyme is found to bind to the mannose 6-phosphate receptor with high affinity resulting in an increased bioavailablity of the enzyme to mannose 6-phosphate bearing cells when compared to the current GBA employed in therapeutic protocols.

SUMMARY OF THE INVENTION

[0012] Accordingly, one object of the present invention is a method of preparing a highly phosphorylated acid β-glucocerbrosidase comprising contacting said acid β-glucocerbrosidase with an isolated GlcNAc phosphotransferase to produce a modified acid β-glucocerbrosidase; and contacting said modified acid β-glucocerbrosidase with an isolated phosphodiester α-GlcNAcase. In a preferred embodiment the highly phosphorylated acid β-glucocerbrosidase is purified after modification with the isolated GlcNAc phosphotransferase or after contacting with the isolated phosphodiester α-GlcNAcase.

[0013] Another object of the present invention is a method of preparing the highly phosphorylated GBA by culturing transfected cells comprising a recombinant polynucleotide which encodes a recombinant acid β-glucocerbrosidase in the presence of at least one α 1,2-mannosidase inhibitor; recovering a high mannose recombinant acid β-glucocerbrosidase from said transfected cell; contacting said high mannose recombinant acid β-glucocerbrosidase with an isolated GlcNAc phosphotransferase to produce a modified acid β-glucocerbrosidase; and contacting said modified acid β-glucocerbrosidase with an isolated phosphodiester α-GlcNAcase.

[0014] Another object of the present invention is a highly phosphorylated acid β-glucocerbrosidase.

[0015] Another object of the present invention pharmaceutical compositions that contain highly phosphorylated acid β-glucocerbrosidase with a pharmaceutically acceptable carrier.

[0016] Another object of the present invention is a method of treating a patient suffering from Gaucher's disease by administering the highly phosphorylated acid β-glucocerbrosidase.

[0017] In one embodiment, the method entails administration to bone and/or lung tissue of the patient. In another embodiment, the highly phosphorylated acid β-glucocerbrosidase is administered with a acid β-glucocerbrosidase which is not highly phosphorylated.

BRIEF DESCRIPTION OF THE DRAWING

[0018]FIG. 1: Mannose-6-Phosphate column comparison of (A) wildtype GBA; (B) highly phosphorylated GBA, phosphorylated with a mixture of α/β/γ GlcNAc-phosphotransferase and α/β GlcNAc-phosphotransferase; and (C) highly phosphorylated GBA, phosphorylated with α/β GlcNAc-phosphotransferase.

DETAILED DESCRIPTION OF THE INVENTION

[0019] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art of molecular biology. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described herein. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In addition, the materials, methods, and examples are illustrative only and are not intended to be limiting.

[0020] Reference is made to standard textbooks of molecular biology that contain definitions and methods and means for carrying out basic techniques, encompassed by the present invention. See, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor Laboratory Press, New York (2001), Current Protocols in Molecular Biology, Ausebel et al (eds.), John Wiley & Sons, New York (2001) and the various references cited therein.

[0021] “Isolated” means separated out of its natural environment.

[0022] “Polynucleotide” in general relates to polyribonucleotides and polydeoxyribonucleotides, it being possible for these to be non-modified RNA or DNA or modified RNA or DNA.

[0023] “Polypeptides” are understood as meaning peptides or proteins which comprise two or more amino acids bonded via peptide bonds.

[0024] The term “acid β-glucocerbrosidase” or “GBA” as used herein refers to enzymes that are involved in glycolipid degradation while in Gaucher deficiency in all cells and tissue, the predominant problem is in reticuloendothelial cells. GBA is also known in the art as acid β-glucosidase.

[0025] Polynucleotides which encode GBA as used herein is understood to mean the sequences exemplified in this application as well as those which have substantial identity to SEQ ID NO: 24 (shown below) and which encode an enzyme having GBA activity. The cDNA comprising SEQ ID NO:24 can produce type I and type II GBA. Initiation codons and stop codon are shown in bold. In one preferred embodiment for the polynucleotide sequence expression a Kozak sequence can be introduced upstream of the initiation codon for better expression and the 3′-UTR can be deleted. agctaaggca ggtacctgca tccttgtttt tgtttagtgg atcctctatc cttcagagac tctggaaccc ctgtggtctt ctcttcatct aatgaccctg aggggatgga gttttcaagt ccttccagag aggaatgtcc caagcctttg agtagggtaa gcatcatggc tggcagcctc acagqattgc ttctacttca ggcagtgtcg tgggcatcag gtgcccgccc ctgcatccct aaaagcttcg gctacagctc ggtggtgtgt gtctgcaatg ccacatactg tgactccttt gaccccccga cctttcctgc ccttggtacc ttcagccgct atgagagtac acgcagtggg cgacggatgg agctgagtat ggggcccatc caggctaatc acacgggcac aggcctgcta ctgaccctgc agccagaaca gaagttccag aaagtgaagg gatttggagg ggccatgaca gatgctgctg ctctcaacat ccttgccctg tcaccccctg cccaaaattt gctacttaaa tcgtacttct ctgaagaagg aatcggatat aacatcatcc gggtacccat ggccagctgt gacttctcca tccgcaccta cacctatgca gacacccctg atgatttcca gttgcacaac ttcagcctcc cagaggaaga taccaagctc aagatacccc tgattcaccg agccctgcag ttggcccagc gtcccgtttc actccttgcc agcccctgga catcacccac ttggctcaag accaatggag cggtgaatgg gaaggggtca ctcaagggac agcccggaga catctaccac cagacctggg ccagatactt tgtgaagttc ctggatgcct atgctgagca caagttacag ttctgggcag tgacagctga aaatgagcct tctgctgggc tgttgagtgg ataccccttc cagtgcctgg gcttcacccc tgaacatcag cgagacttca ttgcccgtga cctaggtcct accctcgcca acagtactca ccacaatgtc cgcctactca tgctggatga ccaacgcttg ctgctgcccc actgggcaaa ggtggtactg acagacccag aagcagctaa atatgttcat ggcattgctg tacattggta cctggacttt ctggctccag ccaaagccac cctaggggag acacaccgcc tgttccccaa caccatgctc tttgcctcag aggcctgtgt gggctccaag ttctgggagc agagtgtgcg gctaggctcc tgggatcgag ggatgcagta cagccacagc atcatcacga acctcctgta ccatgtggtc ggctggaccg actggaacct tgccctgaac cccgaaggag gacccaattg ggtgcgtaac tttgtcgaca gtcccatcat tgtagacatc accaaggaca cgttttacaa acagcccatg ttctaccacc ttggccactt cagcaagttc attcctgagg gctcccagag agtggggctg gttgccagtc agaagaacga cctggacgca gtggcactga tgcatcccga tggctctgct gttgtggtcg tgctaaaccg ctcctctaag gatgtgcctc ttaccatcaa ggatcctgct gtgggcttcc tggagacaat ctcacctggc tactccattc acacctacct gtggcgtcgc cagtgatgga gcagatactc aaggaggcac tgggctcagc ctgggcatta aagggacaga gtcagctcac acgctgtctg tgactaaaga gggcacagca gggccagtgt gagcttacag cgacgtaagc ccaggggcaa tggtttgggt gactcacttt cccctctagg tggtgccagg ggctggaggc ccctagaaaa agatcagtaa gccccagtgt ccccccagcc cccatgctta tgtgaacatg cgctgtgtgc tgcttgcttt ggaaactggg cctgggtcca ggcctagggt gagctcactg tccgtacaaa cacaagatca gggctgaggg taaggaaaag aagagactag gaaagctggg cccaaaactg gagactgttt gtctttcctg gagatgcaga actgggcccg tggagcagca gtgtcagcat cagggcggaa gccttaaagc agcagcgggt gtgcccaggc acccagatga ttcctatggc accagccagg aaaaatggca gctcttaaag gagaaaatgt ttgagcccaa aaaaaaaaaa aaaaaaaaa

[0026] Preferably, polynucleotides that encode GBA are those which hybridize under stringent conditions and are at least 70%, preferably at least 80% and more preferably at least 90% to 95% identical to SEQ ID NO:24. GBA polynucleotides as herein also include those nucleotide sequences found in public databases, for example, those listed below (the corresponding protein ID is shown in parentheses): 1. BC 003356 (AAH 03356.1) 2. D 13286 (BAA 02545) 3. M 19285 (AAA 35880) 4. M 16328 (AAA 35873) 5. K 02920 (AAA 35887) 6. BC 000349 7. J 03059 (AAC 63056) 8. BG 716343 (full length with mutation); and 9. BG 281198 (not full length)

[0027] The GBA protein or polypeptide as used herein is understood to mean the sequences exemplified in this application as well as those which have substantial identity to SEQ ID NO:25 and/or 26. Preferably, such polypeptides are those which are at least 70%, preferably at least 80% and more preferably at least 90% to 95% identical to SEQ ID NO:25 and/or 26. For example, the precursor protein of acid beta glucosidase (GBA, also known as glucocerebrosidase), include four different types of precursor protein. The amino acid sequence of this protein is depicted below (SEQ ID NO:25): MEFSSPSREE CPKPLSRVSI MAGSLTGLLL LQAVSWASGA RPCIPKSFGY SSVVCVCNAT YCDSFDPPTF PALGTFSRYE STRSGRRMEL SMGPIQANHT GTGLLLTLQP EQKFQKVKGF GGAMTDAAAL NILALSPPAQ NLLLKSYFSE EGIGYNIIRV PMASCDFSIR TYTYADTPDD FQLHNFSLPE EDTKLKIPLI HRALQLAQRP VSLLASPWTS PTWLKTNGAV NGKGSLKGQP GDIYHQTWAR YFVKFLDAYA EHKLQFWAVT AENEPSAGLL SGYPFQCLGF TPEHQRDFIA RDLGPTLANS THHNVRLLML DDQRLLLPHW AKVVLTDPEA AKYVHGIAVH WYLDFLAPAK ATLGETHRLF PNTMLFASEA CVGSKFWEQS VRLGSWDRGM QYSHSIITNL LYHVVGWTDW NLALNPEGGP NWVRNFVDSP IIVDITKDTF YKQPMFYHLG HFSKFIPEGS QRVGLVASQK NDLDAVALMH PDGSAVVVVL NRSSKDVPLT IKDPAVGFLE TISPGYSIHT YLWRRQ* HRQ

[0028] From this amino acid sequence, the four different types include: Protein type I; N-terminal is Met 1 and C-terminal is RRQ. (SEQ ID NO:25) Protein type II; N-terminal is Met 21 and C-terminal is RRQ. Protein type III; N-terminal is Met 1 and C-terminal is HRQ. (SEQ ID NO:26) Protein type IV; N-terminal is Met 21 and C-terminal is HRQ.

[0029] In addition to the sequence depicted above, other suitable GBA sequences are known in the art as shown in the Table below, where several GBA proteins are identified by protein ID, reference number, or locus names. Type I Type II Type II Type IV CAD 12720 AAA 35877 NP 000148 CAD 12721 T 08828 1202301 A P 04062 AAA 35873 BAA 02545 EUHUGC AAC 63056 AAA 35880 AAC 51820 1112264A 2004300A AAH 03356.1

[0030] The term “GlcNAc-phosphotransferase” as used herein refers to enzymes that are capable of catalyzing the transfer of N-acetylglucosamine-1-phosphate from UDP-GlcNAc to the 6′ position of 1,2-linked mannoses on lysosomal enzymes. The GlcNAc-phosphotrasferase is composed of six subunits: 2 α subunits, 2 β-subunits and 2 γ subunits. The amino acid sequence of the a subunit is shown in SEQ ID NO:4 (amino acids 1-928), the human β subunit is shown in SEQ ID NO:5 (amino acids 1-328), and the human γ subunit is shown in SEQ ID NO:7 (amino acids 25-305, signal sequence is in amino acids 1-24).

[0031] A novel soluble GlcNAc phosphotransferase has been prepared which is composed of a non-endogenous proteolytic cleavage site interposed between the α and β subunits. When combined with the γ subunit, this GlcNAc phosphotransferase exhibits high levels of activity. The soluble GlcNAc-phosphotransferase protein or polypeptide as used herein is understood to mean the sequences exemplified in this application as well as those which have substantial identity to SEQ ID NO:2. The partial rat and Drosphila melanogaster α/β GlcNAc-phosphotransferase amino acid sequences are shown in SEQ ID NO: 14 and 16, respectively.

[0032] Preferably, the GlcNAc-phosphotransferase polypeptides are those which are at least 70%, preferably at least 80% and more preferably at least 90% to 95% identical to the GlcNAc-phosphotransferase amino acid sequences described herein.

[0033] Polynucleotides which encode the α and β subunits of GlcNAc-phosphotransferase or soluble GlcNAc-phosphotransferase mean the sequences exemplified in this application as well as those which have substantial identity to those sequences and which encode an enzyme having the activity of the α and β subunits of GlcNAc-phosphotransferase. Preferably, such polynucleotides are those which hybridize under stringent conditions and are at least 70%, preferably at least 80% and more preferably at least 90% to 95% identical to those sequences

[0034] The nucleotide sequence for the human α/β subunit precursor cDNA is shown in SEQ ID NO:3 (nucleotides 165-3932), the nucleotide sequence of the α subunit is in nucleotides 165-2948 of SEQ ID NO:3, the nucleotide sequence of the β subunit is shown in nucleotides 2949-3932 of SEQ ID NO:3, and the nucleotide sequence of the γ subunit is shown in SEQ ID NO:6 (nucleotides 24-95). The soluble GlcNAc-phosphotransferase nucleotide sequence is shown in SEQ ID NO:1. The partial rat and Drosphila melanogaster α/β GlcNAc-phosphotransferase nucleotide sequences are shown in SEQ ID NO: 13 and 15, respectively.

[0035] The term “phosphodiester α-GlcNAcase” as used herein refers to enzymes that are capable of catalyzing the removal of N-Acetylglucosamine from GlcNAc-phosphate-mannose diester modified lysosomal enzymes to generate terminal M6P.

[0036] Polynucleotides which encode phosphodiester α-GlcNAcase as used herein is understood to mean the sequences exemplified in this application as well as those which have substantial identity to SEQ ID NO:19 (murine) or SEQ ID NO:17 (human) and which encode an enzyme having the activity of phosphodiester α-GlcNAcase. Preferably, such polynucleotides are those which hybridize under stringent conditions and are at least 70%, preferably at least 80% and more preferably at least 90% to 95% identical to SEQ ID NOS:17 and/or 19.

[0037] The phosphodiester α-GlcNAcase protein or polypeptide as used herein is understood to mean the sequences exemplified in this application as well as those which have substantial identity to SEQ ID NO:20 (murine) or SEQ ID NO:18 (human). Preferably, such polypeptides are those which are at least 70%, preferably at least 80% and more preferably at least 90% to 95% identical to SEQ ID NOS:18 and/or 20.

[0038] The terms “stringent conditions” or “stringent hybridization conditions” includes reference to conditions under which a polynucleotide will hybridize to its target sequence, to a detectably greater degree than other sequences (e.g., at least 2-fold over background). Stringent conditions are sequence-dependent and will be different in different circumstances. By controlling the stringency of the hybridization and/or washing conditions, target sequences can be identified which are 100% complementary to the probe (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected (heterologous probing).

[0039] Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., 10 to 50 nucleotides) and at least about 60° C. for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCl, 1% SDS (sodium dodecyl sulphate) at 37° C., and a wash in 1× to 2×SSC (20×SSC=3.0 M NaCl/0.3 M trisodium citrate) at 50 to 55° C. Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1 M NaCl, 1% SDS at 37° C., and a wash in 0.5× to 1×SSC at 55 to 60° C. Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37° C., and a wash in 0.1×SSC at 60 to 65° C.

[0040] Specificity is typically the function of post-hybridization washes, the critical factors being the ionic strength and temperature of the final wash solution. For DNA-DNA hybrids, the T_(m) can be approximated from the equation of Meinkoth and Wahl, Anal. Biochem., 138:267-284 (1984): T_(m)=81.5° C.+16.6 (log M)+0.41 (% GC)−0.61 (% form)−500/L; where M is the molarity of monovalent cations, % GC is the percentage of guanosine and cytosine nucleotides in the DNA, % form is the percentage of formamide in the hybridization solution, and L is the length of the hybrid in base pairs. The T_(m) is the temperature (under defined ionic strength and pH) at which 50% of a complementary target sequence hybridizes to a perfectly matched probe. T_(m) is reduced by about 1° C. for each 1% of mismatching; thus, T_(m), hybridization and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with approximately 90% identity are sought, the T_(m) can be decreased 10° C. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (T_(m)) for the specific sequence and its complement at a defined ionic strength and pH. However, severely stringent conditions can utilize a hybridization and/or wash at 1, 2, 3, or 4° C. lower than the thermal melting point (T_(m)); moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or 10° C. lower than the thermal melting point (T_(m)); low stringency conditions can utilize a hybridization and/or wash at 11, 12, 13, 14, 15, or 20° C. lower than the thermal melting point (T_(m)). Using the equation, hybridization and wash compositions, and desired T_(m), those of ordinary skill will understand that variations in the stringency of hybridization and/or wash solutions are inherently described. If the desired degree of mismatching results in a T_(m) of less than 45° C. (aqueous solution) or 32° C. (formamide solution) it is preferred to increase the SSC concentration so that a higher temperature can be used. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology—Hybridization with Nucleic Acid Probes, Part I, Chapter 2 “Overview of principles of hybridization and the strategy of nucleic acid probe assays”, Elsevier, N.Y. (1993); and Current Protocols in Molecular Biology, Chapter 2, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (1995).

[0041] Homology, sequence similarity or sequence identity of nucleotide or amino acid sequences may be determined conventionally by using known software or computer programs such as the BestFit or Gap pairwise comparison programs (GCG Wisconsin Package, Genetics Computer Group, 575 Science Drive, Madison, Wis. 53711). BestFit uses the local homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2: 482-489 (1981), to find the best segment of identity or similarity between two sequences. Gap performs global alignments: all of one sequence with all of another similar sequence using the method of Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970). When using a sequence alignment program such as BestFit, to determine the degree of sequence homology, similarity or identity, the default setting may be used, or an appropriate scoring matrix may be selected to optimize identity, similarity or homology scores. Similarly, when using a program such as BestFit to determine sequence identity, similarity or homology between two different amino acid sequences, the default settings may be used, or an appropriate scoring matrix, such as blosum45 or blosum80, may be selected to optimize identity, similarity or homology scores.

[0042] The high-affinity ligand for the cation-independent M6P receptor is an oligosaccharide containing two M6P groups (i.e., a bis-phosphorylated oligosaccharide). Since a bis-phosphorylated oligosaccharides binds with an affinity 3500-fold higher than a monophosphorylated oligosaccharides, virtually all the high-affinity binding of a lysosomal enzyme to the M6P receptor will result from the content of bis-phosphorylated oligosaccharides (Tong, P. Y., Gregory, W., and Kornfeld, S. (1989)). “Ligand interactions of the cation-independent mannose 6-phosphate receptor. The stoichiometry of mannose 6-phosphate binding.” Journal of Biological Chemistry 264: 7962-7969). It is therefore appropriate to use the content of bis-phosphorylated oligosaccharides to compare the binding potential of different preparations of GBA.

[0043] The phrase “highly phosphorylated GBA” as used herein refers to GBA which contains more bis-phosphorylated oligosaccharides compared to known naturally occurring or recombinant GBA. Preferably, GBA contains at least 5% bis-phosphorylated oligosaccharides compared to GBA not treated with the GlcNAc-phosphotransferase described herein. More preferably, the “highly phosphorylated GBA” has at least 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%,14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%,23%, 24%, 25%, 26%, 27%, 28%, 29%,30%, 40%,45%, 50%, 60%, 70%, 80%, 85%, 90%, 95%, 100% bis-phosphorylated oligosaccharides and all values and ranges there between. This highly phosphorylated GBA have a higher affinity for the M6P receptor and are therefore more efficiently taken into the cell by plasma membrane receptors.

[0044] The phrase “highly phosphorylated GBA” as used herein refers to GBA which is more highly phosphorylated naturally or recombinant GBA. Preferably, highly phosphorylated GBA contains at least 5% of the molecules that bind with high affinity to a mannose 6 phosphate column. More preferably, the “highly phosphorylated GBA” has at least 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%,14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%,23%, 24%, 25%, 26%, 27%, 28%, 29%,30%, 40%,45%, 50%, 60%, 70%, 80%, 85%, 90%, 95%, 100% mannose 6 receptor binding high affinity GBA and all values and ranges there between. This highly phosphorylated GBA have a higher affinity for the M6P receptor and are therefore more efficiently taken into the cell by plasma membrane receptors.

[0045] The high-affinity ligand for the cation-independent M6P receptor is an oligosaccharide containing two M6P groups (i.e., a bis-phosphorylated oligosaccharide). Since a bisphosphorylated oligosaccharides binds with an affinity 3500-fold higher than a monophosphorylated oligosaccharides, virtually all the high-affinity binding of a lysosomal enzyme to the M6P receptor will result from the content of bis-phosphorylated oligosaccharides (Tong, P. Y., Gregory, W., and Kornfeld, S. (1989)). “Ligand interactions of the cation-independent mannose 6-phosphate receptor. The stoichiometry of mannose 6-phosphate binding.” Journal of Biological Chemistry 264: 7962-7969). It is therefore appropriate to use the content of bis-phosphorylated oligosaccharides to compare the binding potential of different preparations of lysosomal enzymes.

[0046] In addition to measuring the highly phosphorylated GBA using the M6P binding assay described herein, the extent of phosphorylation and thus, uptake of the highly phosphorylated GBA can be measured using a fibroblast uptake protocol.

[0047] This fibroblast uptake protocol may be conducted as follows:

[0048] Enzyme preparation: Dilute an enzyme preparation in PBS (pH 7.2) and plate in triplicate into a black 96well plate (25 μl/well). The amount of purified enzyme is equivalent to 1 million counts, this is the amount added per well to the cells in the uptake assay.

[0049] Uptake assay. Using a confluent flask of fibroblasts (for example, GM00372, GM 04394 and GM 07968, which are Gaucher disease type I and GM01260, and GM 00877 Gaucher disease type II, and GM 10915 Gaucher disease, type uncertain, all are accession numbers at the Coriel Cell Repository, 401 Haddon Avenue, Camden, N.J. 08103) aspirate the medium and wash cells in 10 ml of DPBS, aspirate DPBS, add 3 ml of trypsin to the flask and rock to coat the cells, incubate for approximately 5 minutes. Resuspend in 7 ml Dulbeccos' modified essential media (DMEM), final volume should be 10 ml. Count the number of cells in a hemacytometer and dilute suspension to produce 150,000 cells/ml using DMEM, plate 3 ml of cell suspension into four 60 mm culture dishes and incubate at 37° C. overnight. The next day change the medium on each dish to uptake medium (containing Ham's F-12, 10% Heat-Inactivated FBS, 3 mM Pipes, pH 6.7). Two hours later add 15 μl mannose-6-phosphate to each of two dishes. Two hours later 1 million fluorescent counts of the enzyme is added to the one of the two dishes having mannose-6-phosphate and the other to a dish containing only PBS (if the enzymatic activity is a fluorescent assay, e.g., using 4-MU-β-glucose, as described herein, it is preferable to employ a dark or black plate; and if a colorometric assay, e.g., using BCA, it is preferable to employ a clear plate). Thus, the dishes for each enzyme sample to be tested are (1) PBS only, (2) PBS and mannose-6-phosphate, (3) Enzyme, (4) Enzyme and mannose-6-phosphate, and (5) normal human fibroblasts. The dishes are incubated at 37° C. for 16 hours. Remove the medium and save in 15 ml conical tubes, wash the cells 3 times with DPBS, and harvest the cells with a cell scraper. Suspend cells in 1 ml DPBS and save in a 1.5 ml microcentrifuge tube. Centrifuge the cells at 14,000 RPM for 2 minutes, aspirate the DPBS and resuspend in 1 ml DPBS. Vortex the cells and repeat the centrifugation, and DPBS washing step 4 times. After the fourth washing step, lyse the cells in 110 μl 0.25% Triton X-100 at room temperature for 1 to 2 hours. The amount of protein present or the enzymatic activity is measured using the methods described herein or those commonly employed in the art.

[0050] The activity of GBA can be assayed using the following method. Prepare a 16 mM 4-methylumbelliferyl β-D-glucoside (4-MU-β-Glu) in 4×CP buffer (prepared by mixing 43.5 ml of 0.1M citric acid (21.01 g citric acid/liter) and 0.2 M Disodium Phosphate (28.4 g sodium phosphate anhydrous/liter)) mix and warm to 42° C. until the solution becomes clear. Then prepare an assay buffer by mixing equal parts of the 16 mM 4-MU-β-Glu and 4×CP Buffer and 1% TC/TX (made by dissolving, 1 gram of taurochoric acid, sodium salt in water and adjust volume to 90 ml, then add 10 ml of 10% Triton X-100). In a 96 well black pandex plate, add sample to assay (Adjust volume to 25 μl with water) and add 25 μl of assay buffer, incubate at 37° C. for 1 hour and stop the reaction by adding 125 mL of 1 M Glycine-NaOH, pH 10.5. Measure the amount of 4-MU released from 4-Mu-β-Glu by comparing fluorescence at Ex=360, Em=455 with a standard curve of free 4-MU.

[0051] To distinguish GBA activity from other types of beta glucocerbrosidase, other substrates, such as 4-MU-cellobioside, 4-MU-cellotrioside may be employed as well.

[0052] To determine the extent to which the GBA is phosphorylated, the GBA pre and post-phosphorylation treatment can be assayed by binding to Mannose-6-phosphate as described herein and in Hoflack et al (1985) J Bio Chem 260:12008-120014.

[0053] Recombinant expression vectors containing a nucleic acid sequence encoding GBA, GlcNAc phosphotransferase (soluble and insoluble forms) and/or phosphodiester α-GlcNAcase can be prepared using well-known techniques. The expression vectors include a DNA sequence operably linked to suitable transcriptional or translational regulatory nucleotide sequences such as those derived from mammalian, microbial, viral, or insect genes. Examples of regulatory sequences include transcriptional promoters, operators, enhancers, mRNA ribosomal binding sites, and appropriate sequences which control transcription and translation initiation and termination. Nucleotide sequences are “operably linked” when the regulatory sequence functionally relates to the DNA sequence for the appropriate enzyme. Thus, a promoter nucleotide sequence is operably linked to a GlcNAc-phosphotransferase DNA sequence if the promoter nucleotide sequence controls the transcription of the appropriate DNA sequence.

[0054] The ability to replicate in the desired host cells, usually conferred by an origin of replication and a selection gene by which transfectants are identified, may additionally be incorporated into the expression vector.

[0055] In addition, sequences encoding appropriate signal peptides that are not naturally associated with GBA, GlcNAc phosphotransferase (soluble and insoluble forms) and/or phosphodiester α-GlcNAcase can be incorporated into expression vectors. For example, a DNA sequence for a signal peptide (secretory leader) may be fused in-frame to the enzyme sequence so that the enzyme is initially translated as a fusion protein comprising the signal peptide. A signal peptide that is functional in the intended host cells enhances extracellular secretion of the appropriate polypeptide. The signal peptide may be cleaved from the polypeptide upon secretion of enzyme from the cell.

[0056] Suitable host cells for expression of the GBA, GlcNAc phosphotransferase and/or phosphodiester α-GlcNAcase include prokaryotes, yeast, archae, and other eukaryotic cells. Preferred cells include insect cells and eukaryotic cells, examples of which include, but not limited to: SF9, SF+, CHO, Hela, 293T NS0, etcetera.

[0057] Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and mammalian cellular hosts are well known in the art, e.g., Pouwels et al. Cloning Vectors: A Laboratory Manual, Elsevier, N.Y. (1985). The vector may be a plasmid vector, a single or double-stranded phage vector, or a single or double-stranded RNA or DNA viral vector. Such vectors may be introduced into cells as polynucleotides, preferably DNA, by well-known techniques for introducing DNA and RNA into cells. The vectors, in the case of phage and viral vectors also may be and preferably are introduced into cells as packaged or encapsulated virus by well-known techniques for infection and transduction. Viral vectors may be replication competent or replication defective. In the latter case viral propagation generally will occur only in complementing host cells. Cell-free translation systems could also be employed to produce the enzymes using RNAs derived from the present DNA constructs.

[0058] Expression vectors for use in host cells generally comprise one or more phenotypic selectable marker genes. A phenotypic selectable marker gene is, for example, a gene encoding a protein that confers antibiotic resistance or that supplies an autotrophic requirement. Examples of useful expression vectors for prokaryotic host cells include those derived from commercially available plasmids such as the cloning vector pBR322 (ATCC 37017). pBR322 contains genes for ampicillin and tetracycline resistance and thus provides simple means for identifying transfected cells. To construct an expression vector using pBR322, an appropriate promoter and a DNA sequence are inserted into the pBR322 vector.

[0059] Other commercially available vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and pGEM1 (Promega Biotec, Madison, Wis., USA).

[0060] Promoter sequences commonly used for recombinant prokaryotic host cell expression vectors include β-lactamase (penicillinase), lactose promoter system (Chang et al., Nature275:615, (1978); and Goeddel et al., Nature 281:544, (1979)), tryptophan (trp) promoter system (Goeddel et al., Nucl. Acids Res. 8:4057, (1980)), and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, p. 412 (1982)).

[0061] Yeasts useful as host cells in the present invention include those from the genus Saccharomyces, Pichia, K. Actinomycetes and Kluyveromyces. Yeast vectors will often contain an origin of replication sequence from a 2μ yeast plasmid, an autonomously replicating sequence (ARS), a promoter region, sequences for polyadenylation, sequences for transcription termination, and a selectable marker gene. Suitable promoter sequences for yeast vectors include, among others, promoters for metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255:2073, (1980)) or other glycolytic enzymes (Holland et al., Biochem. 17:4900, (1978)) such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. Other suitable vectors and promoters for use in yeast expression are further described in Fleer et al., Gene, 107:285-195 (1991). Other suitable promoters and vectors for yeast and yeast transfectation protocols are well known in the art.

[0062] Yeast transfectation protocols are known to those of skill in the art. One such protocol is described by Hinnen et al., Proceedings of the National Academy of Sciences USA, 75:1929 (1978). The Hinnen protocol selects for Trp⁺ transfectants in a selective medium, wherein the selective medium consists of 0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, 10 μg/ml adenine, and 20 μg/ml uracil.

[0063] Mammalian or insect host cell culture systems well known in the art could also be employed to express recombinant GBA, GlcNAc phosphotransferase and/or phosphodiester αa-GlcNAcase polypeptides, e.g., Baculovirus systems for production of heterologous proteins in insect cells (Luckow and Summers, Bio/Technology 6:47 (1988)) or Chinese hamster ovary (CHO) cells for mammalian expression may be used. Transcriptional and translational control sequences for mammalian host cell expression vectors may be excised from viral genomes. Commonly used promoter sequences and enhancer sequences are derived from Polyoma virus, Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. DNA sequences derived from the SV40 viral genome may be used to provide other genetic elements for expression of a structural gene sequence in a mammalian host cell, e.g., SV40 origin, early and late promoter, enhancer, splice, and polyadenylation sites. Viral early and late promoters are particularly useful because both are easily obtained from a viral genome as a fragment which may also contain a viral origin of replication. Exemplary expression vectors for use in mammalian host cells are well known in the art.

[0064] The GBA, GlcNAc phosphotransferase and/or phosphodiester α-GlcNAcase of the present invention may, when beneficial, be expressed as a fusion protein that has the enzyme attached to a fusion segment. The fusion segment often aids in protein purification, e.g., by permitting the fusion protein to be isolated and purified by affinity chromatography. Fusion proteins can be produced by culturing a recombinant cell transfected with a fusion nucleic acid sequence that encodes a protein including the fusion segment attached to either the carboxyl and/or amino terminal end of the enzyme. Preferred fusion segments include, but are not limited to, glutathione-S-transferase, β-galactosidase, a poly-histidine segment capable of binding to a divalent metal ion, and maltose binding protein. In addition, the HPC-4 epitope purification system may be employed to facilitate purification of the enzymes of the present invention. The HPC-4 system is described in U.S. Pat. No. 5,202,253, the relevant disclosure of which is herein incorporated by reference.

[0065] According to the present invention, isolated enzymes may be produced by the recombinant expression systems described above. The method comprises culturing a host cell transfected with an expression vector comprising a DNA sequence that encodes the enzyme under conditions sufficient to promote expression of the enzyme. The enzyme is then recovered from culture medium or cell extracts, depending upon the expression system employed. As is known to the skilled artisan, procedures for purifying a recombinant protein will vary according to such factors as the type of host cells employed and whether or not the recombinant protein is secreted into the culture medium. When expression systems that secrete the recombinant protein are employed, the culture medium first may be concentrated. Following the concentration step, the concentrate can be applied to a purification matrix such as a gel filtration medium. Alternatively, an anion exchange resin can be employed, e.g., a matrix or substrate having pendant diethylaminoethyl (DEAE) groups. The matrices can be acrylamide, agarose, dextran, cellulose, or other types commonly employed in protein purification. Also, a cation exchange step can be employed. Suitable cation exchangers include various insoluble matrices comprising sulfopropyl or carboxymethyl groups. Further, one or more reversed-phase high performance liquid chromatography (RP-HPLC) steps employing hydrophobic RP-HPLC media (e.g., silica gel having pendant methyl or other aliphatic groups) can be employed to further purify the enzyme. Some or all of the foregoing purification steps, in various combinations, are well known in the art and can be employed to provide an isolated and purified recombinant protein.

[0066] Recombinant protein produced in bacterial culture is usually isolated by initial disruption of the host cells, centrifugation, extraction from cell pellets if an insoluble polypeptide, or from the supernatant fluid if a soluble polypeptide, followed by one or more concentration, salting-out, ion exchange, affinity purification, or size exclusion chromatography steps. Finally, RP-HPLC can be employed for final purification steps. Host cells can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents.

[0067] The invention provides methods of phosphorylating GBA and the thus obtained phosphorylated GBA enzymes. GBA is produced by treating the high mannose GBA with α/β GlcNAc-phosphotransferase (soluble or insoluble, as well as mixed α/β/γ GlcNAc phosphotransferase, with α/β GlcNAc phosphotransferase) which catalyzes the transfer of N-acetylglucosamine-1-phosphate from UDP-GlcNAc to the 6′ position of 1,2-linked or other mannoses on the hydrolase. Preferably, the GlcNAc-phosphotransferase is the soluble α/β GlcNAc-phosphotransferase described herein. Also it was shown that the γ subunit of the GlcNAc-phosphotransferase is not required.

[0068] Methods for treating GBA with the enzymes of the present invention are within the skill of the artisan. Generally, the GBA is at a concentration of about 10 mg/ml and GlcNAc-phosphotransferase is present in a concentration of about 1 to about 10 million units per milliliter. The enzymes are incubated at about 20° C. for about 48 hours or longer in the presence of a buffer that maintains the pH at about 6-7 and any stabilizers or coenzymes required to facilitate the reaction. Then, phosphodiester α-GlcNAcase can be added to the system to a concentration of about 250,000 to 1,000,000 units/mL and the system is allowed to incubate for about 6 or more hours. The modified GBA enzyme having highly phosphorylated oligosaccharides is then recovered by conventional means.

[0069] In a preferred embodiment, the GBA at 10 mg/ml is incubated in 50 mm Sodium Acetate pH 6.5, 20 mM MnCl₂, 0.3 mM (300 μM) with GlcNAc phosphotransferase at 1 to 10 million units/ml at 20° C. for 48 hours or longer,. The GBA is then treated with phosphodiester-α GlcNAcase for 6 hours. The modified enzyme is then repurified by conventional chromatography.

[0070] High mannose GBA for treatment according to the present invention can be obtained from any convenient source, e.g., by isolating and purifying naturally occurring enzymes or by recombinant techniques for the production of proteins.

[0071] High mannose GBA can be prepared by expressing the DNA encoding the GBA in any host cell system that generates a oligosaccharide modified protein having high mannose structures, e.g., yeast cells, insect cells, other eukaryotic cells, transfected Chinese Hamster Ovary (CHO) host cells, or other mammalian cells.

[0072] In one embodiment, high mannose GBA is produced using mutant yeast that are capable of expressing peptides having high mannose structures. These yeast include the mutant S. cervesiae ochl, mnnl (Nakanishi-Shindo, Y., Nakayama, K. I., Tanaka, A., Toda, Y. and Jigami, Y. (1993). “Structure of the N-linked oligosaccharides that show the complete loss of α-1,6-polymannose outer chain from ochl, ochl mnnl, and ochl mnnl alg3 mutants of Saccharomyces cerevisiae.” Journal of Biological Chemistry 268: 26338-26345).

[0073] Preferably, high mannose GBA is produced using over-expressing transfected insect, CHO, or other mammalian cells that are cultured in the presence of certain inhibitors. Normally, cells expressing lysosomal enzymes secrete enzymes that contains predominantly sialylated complex type glycans that do not serve as a substrate for GlcNAc-phosphotransferase and therefore cannot be modified to use the M6P receptor.

[0074] According to the present invention, transfected cells containing DNA that expresses a recombinant GBA can be manipulated so that the cells secrete high mannose GBA that can be modified according to the above method. In this method, transfected cells are cultured in the presence of α 1,2-mannosidase inhibitors and the high mannose recombinant GBA is recovered from the culture medium. Inhibiting α 1,2-mannosidase prevents the enzyme from trimming mannoses and forces the cells to secrete glycoproteins having the high mannose structure. High mannose GBA is recovered from the culture medium using known techniques and treated with α/β GlcNAc-phosphotransferase and phosphodiester α-GlcNAcase according to the method herein to produce GBA that has M6P and can therefore bind to membrane M6P receptors and be taken into the cell having the M6P receptor. Preferably, the cells are CHO cells and the GBA is secreted with the MAN7(D₂D₃) structure, more preferably the cells are CHO K1 cells, and even more preferred are CHO K1 cells that are GnT I deficient.

[0075] In a preferred embodiment, recombinant human GBA is prepared by culturing CHO cells secreting GBA in Dulbecco's modified essential media (DMEM) modified by the addition of an alpha 1,2-mannosidase inhibitor. Isolation of GBA from the media followed by digestion with either N-glycanase or endoglycosidase-H demonstrates that in the presence of the alpha 1,2-mannosidase inhibitor the GBA retains high mannose structures rather than the complex structures found on a preparation secreted in the absence of the inhibitor. The isolated GBA bearing high mannose structures is then purified to homogeneity, preferably by chromatography beginning with ion exchange chromatography on ConA-Sepharose, followed toyopearl butyl 650M, Phenyl-Sepharose or octyl Sepharose. The purified GBA is then treated in vitro with α/βGlcNAc-phosphotransferase to convert specific mannoses to GlcNAc-phospho-mannose diesters. The GlcNAcphosphomannose diesters are then converted to M6P groups by treatment with phosphodiester a GlcNAcase.

[0076] Any α1,2-mannosidase inhibitor can function in the present invention. Preferably, the inhibitor is selected from the group consisting of deoxymannojirimycin (dMM), kifunensine, D-Mannonolactam amidrazone, and N-butyl-deoxymannojirimycin. Most preferably the inhibitor is deoxymannojimycin and/or kifunensine.

[0077] The present invention also provides methods for the treatment of Gaucher's disease by administering an effective amount of the highly phosphorylated GBA of the present invention to a patient diagnosed with the Gaucher's disease. As used herein, being diagnosed with Gaucher's includes pre-symptomatic phases of the disease and the various symptomatic Gaucher's disease. Typically, the pre-symptomatic patient will be diagnosed with Gaucher's disease by means of a genetic analysis known to the skilled artisan.

[0078] As discussed above, the administration of the highly phosphorylated GBA will target tissues, for example, lung and bone tissues, that posses the M6P receptor. Thus, the present highly phosphorylated GBA will be targeted to tissues that are not normally targeted using recombinant or naturally purified GBA thereby resulting in an increased positive effect on the patient suffering from Gaucher's disease.

[0079] In one embodiment of the present invention is a method of treating lung or lung tissue in patients with Gaucher's by administering the highly phosphorylated GBA to said patient. In another embodiment of the present invention the highly phosphorylated GBA is administered to bone or bone tissue of Gaucher's patients.

[0080] In one embodiment, the present invention provides a method of treating Gaucher's disease using a combination of both highly phosphorylated GBA prepared in accordance with the present invention and the GBA not so prepared, i.e., having little or no phosphorylation. Therefore, the combination of both types of GBA enzymes would substantially increase the tissues treated by the enzyme replacement therapy as described herein.

[0081] While dosages may vary depending on the disease and the patient, highly phosphorylated GBA is generally administered to the patient in amounts of from about 0.1 to about 1000 milligrams per kg of patient per month, preferably from about 1 to about 500 milligrams per kg of patient per month. The highly phosphorylated GBA of the present invention is taken into the cell expressing the M6P receptor than the naturally occurring or less phosphorylated GBA and are therefore effective for the treatment of Gaucher's disease. Amongst various patients the severity and the age at which the disease presents itself may be a function of the amount of residual GBA enzyme that exists in the patient. As such, the present method of treating Gaucher's diseases includes providing the highly phosphorylated GBA at any or all stages of disease progression.

[0082] The GBA enzyme may be administered by any convenient means, conventionally known to those of ordinary skill in the art. For example, the enzyme may be administered in the form of a pharmaceutical composition containing the enzyme and a pharmaceutically acceptable carrier or by means of a delivery system such as a liposome or a controlled release pharmaceutical composition. The term “pharmaceutically acceptable” refers to molecules and compositions that are physiologically tolerable and do not typically produce an allergic or similar unwanted reaction such as gastric upset or dizziness when administered. Preferably, “pharmaceutically acceptable” means approved by a regulatory agency of the Federal or a state government or listed in the U.S. Pharmacopoeia or other generally recognized pharmacopoeia for use in animals, preferably humans. The term “carrier” refers to a diluent, adjuvant, excipient, or vehicle with which the compound is administered. Such pharmaceutical carriers can be sterile liquids, such as saline solutions, dextrose solutions, glycerol solutions, water and oils emulsions such as those made with oils of petroleum, animal, vegetable, or synthetic origin (peanut oil, soybean oil, mineral oil, or sesame oil). Water, saline solutions, dextrose solutions, and glycerol solutions are preferably employed as carriers, particularly for injectable solutions.

[0083] The enzyme or the composition may be administered by any standard technique compatible with enzymes or their compositions. For example, the enzyme or composition can be administered parenterally, transdermally, or transmucosally, e.g., orally or nasally. Preferably, the enzyme or composition is administered by intravenous injection.

[0084] The following Examples provide an illustration of embodiments of the invention and should not be construed to limit the scope of the invention which is set forth in the appended claims. In the following Examples, all methods described are conventional unless otherwise specified.

EXAMPLES

[0085] Human Acid β-Glucocerbrosidase

[0086] A mammalian expression vector was constructed by subcloning a cDNA for human acid β-glucocerbrosidase (GBA) into the EcoR I site of the pcDNA6/V5/His-A (Invitrogen) construct. This plasmid was renamed pDH1. A cDNA of human GBA was subcloned into the XbaI and EcoRI sites of the pEE14 vector (Lonza Biologics) and named pCC4.

[0087] Human GlcNAc-Phosphotransferase

[0088] Plasmid pMK 163 was constructed to express recombinant soluble human GlcNAc-phosphotransferase. GlcNAc-phosphotransferase is an enzyme that consists of four subunits; α2β2. The α and β subunits are encoded on a single mRNA and proteolytically cleaved after translation. The wild type human GlcNAc-phosphotransferase is associated with the Golgi apparatus through transmembrane domains at the N-terminal of the α subunit, and C-terminal of the β subunit. By the following modification, a cDNA encoding a soluble form of recombinant human GlcNAc-phosphotransferase was made. The α/β subunit was modified from the wild type sequence as follows; (1) 24 amino acids located on the N-terminal of the α subunit, which is a putative signal/transfer transmembrane domain, were replaced with the immunoglobulin leader sequence (METDTLLLWVLLLWVPGSTG-SEQ ID NO:22) and the HPC4 epitope (DEDQVDPRLIDGK-SEQ ID NO:23)(Rezaie, A. R et. al (1992) Protein Expr Purif, 3, 453-60) (2) 47 amino acids at the C-terminus were removed by replacing the codons encoding these amino acids by a stop codon. (3) 6 amino acids just before the α/β cleavage site were replaced with RARYKR (SEQ ID NO:27)which is a cleavage sequence for furin (Nakayama, K., (1997) Biochem.J, 327, 625-635), which is a proprotein processing enzyme. The plasmid pMK 155 uses a pEE14 (Lonza Biologics) backbone to express α/β subunits, thus modified.

[0089] Human N-acetylglucosamine-1-Phosphodiester α-N-acetylglucosaminidase (UCE)

[0090] Plasmid pKB 6 was constructed to express recombinant soluble uncovering enzyme. The molecular cloning and expression of wild type uncovering enzyme is described in Kornfeld et al. ((1999) Biochem J, 274, 32778-32785). Uncovering enzyme consists of four identical subunits arranged as two disulfide-linked homodimers. The wild type human uncovering enzyme is associated with the Golgi apparatus through a transmembrane domain at the C-terminal end of the polypeptide. A cDNA encoding a soluble form of recombinant human uncovering enzyme was made by replacing 68 amino acids at the C-terminal with a HPC4 epitope tag (EDQVDPRLIDGKD-(SEQ ID NO:3)). The modified cDNA encoding soluble rh-UCE then was subcloned into pEE 14 (Lonza Biologics).

[0091] GBA Transfection

[0092] The cells were cultured in 16% CO2 to maintain a slightly acidic culture medium. In order to express the GBA protein the pDH1 plasmid was transienty transfected into 293T cells. Four Nunc cell factories (6320 cm²) were seeded with approximately 2×10⁸ cells each in Dulbecco's Modification of Eagles Medium (DMEM) containing 10% fetal bovine serum (FBS). In addition kifunensine, a glycosidase inhibitor which acts on the N-linked oligosaccharides processing pathway was added to 5 μg/ml. The cells were transfected using FuGene 6 (Roche) according to the manufacturers instruction. The media from the cells were harvested approximately 96 hours post-transfection.

[0093] GlcNAc-phosphotransferase Transfection

[0094] In order to develop a stable cell line secreting soluble human GlcNAc-phosphotransferase, the glutamine synthetase (GS) expression system (Lonza Biologics) was utilized (Bebbington, C. R., (1998) Current Protocols in Molecular Biology, 16(14),7-13). The pMK 155 plasmid was transfected into CHO-K1 cells and the media from clones which survived the methionine sulfoximine (MSX) selection were assayed for GlcNAc-phosphotransferase activity (Reitman, M. L., et.al (1984) Methods Enzymol. 107, 163-172). A clone expressing high levels of GlcNAc-phosphotransferase in the media was selected as the source of GlcNAc-phosphotransferase. In addition, a stable line using pCC4 was made under the same conditions as described above.

[0095] Phosphodiester α-GlcNAcase Transfection

[0096] In order to develop a stable cell line secreting soluble human uncovering enzyme, glutamine synthetase (GS) system (Lonza Biologics) was utilized. The plasmid pKB 6 was transfected into CHO-K1 cells and the media from clones which survived the MSX selection were assayed for UCE activity (9). A clone expressing high levels of UCE was selected as the source of uncovering enzyme.

[0097] Purification of GBA

[0098] The GBA purification scheme consisted of concentrating the harvested media 10-fold from 8 liters to 0.8 liters with a Millipore Pelicon concentrator and then incubating the concentrated media with Con A sepharose (Pharmacia) for approximately 3 hours at 4° C. The Con A sepharose was then packed into a column, washed with 25 mM Tris-HCl pH 6.5, 0.5M NaCl, 1 mM MnCl₂, 1 mM CaCl₂ and eluted with 25 mM Tris-HCl, pH 6.5, 0.5 M NaCl, 1M α-methyl glucoside. The fractions were assayed for GBA and the peak fractions were pooled. The GBA was then loaded onto a Toyopearl Butyl 650M (TosoHass) column and eluted with a 10 column volume gradient of 0-60% ethylene glycol followed by 100% ethylene glycol. The fractions were again assayed for GBA activity. The peak fractions were pooled and dialyzed overnight at 4° C. in 50 mM sodium acetate pH 5.5, 150 mM NaCl.

[0099] GBA activity was measured by using 4-methyl-umberyferyl-β-glucoside (4MU-β-Glc, Sigma) as a substrate. The amount of GBA which converts 1 nmol of 4MU-β-Glc into 1 .nmol each of 4-methyl-umberyferone (4MU) and glucoside at 37° C. per hour was defined as 1 unit.

[0100] GBA

4MU-β-Glc→4MU+Glc

[0101] Briefly, samples were incubated with 4 mM 4MU-β-Glc in 1×assay buffer in the presence of 0.25% (V/V) Triton X-100 and 0.25% (W/V) sodium taurocholate in 40 μl at 37° C. for 30 min to 2 hrs. A 4×assay buffer, pH 5.5, was made by mixing 43.5 ml of 0.1 M citric acid and 0.2 M disodium phosphate solution. The reaction was stopped by adding 100 μl of 1 M Glycine/NaOH, pH 10.5 solution. The amount of 4-MU converted. during the incubation was measured by detecting fluorescence at excitation wavelength=360 nm, emission wavelength=455 nm. Assay results were compared to a standard curve obtained from known amount (0, 25, 50, 100, 200, and 400 pmol) of 4-MU (Sigma).

[0102] Purification of GlcNAc-phosphotransferase

[0103] HPC4 (Oklahoma Medical Research Foundation; OMRF) was coupled to Ultralink Biosupport Medium (Pierce) following the manufacturer's instructions. The HPC4:ultralink resin was equilibrated in 50 mM Tris-HCl, 150 mM NaCl and 2 mM CaCl₂. Cell culture medium from the CHO-K1 cells was incubated with the HPC4:ultralink resin for 16 hrs at 4° C. to capture the GlcNAc-phosphotransferase which contained the epitope tag for the HPC4. The bound GlcNAc-phosphotransferase was eluted with 50 mM Tris-HCl, 150 mM NaCl and 5 mM EGTA, concentrated and buffer-exchanged to 50 mM Tris-HCl, 150 mM NaCl and 5 mM MgCl₂. The amount of GlcNAc-phosphotransferase which can transfer 1 pmol per hour of GlcNAc-phosphate from UDP-GlcNAc (donor) to α-methyl mannoside (acceptor) is defined as 1 unit (Reitman et al Meth. Enzym. 107:163-172 (1984)).

[0104] Purification of phosphodiester α-GlcNAcase

[0105] HPC4 (OMRF) was coupled to Ultralink Biosupport Medium (Pierce) following the manufacturer's instructions. The HPC4:ultralink resin was equilibrated in 50 mM Tris-HCl, 150 mM NaCl and 2 mM CaCl₂. Cell culture medium from the transfected CHO-K1 cells was incubated with the HPC4:ultralink resin for 16 hrs at 4° C. to capture the phosphodiester α-GlcNAcase which contained the epitope tag for HPC4. The the phosphodiester α-GlcNAcase was eluted with 50 mM Tris-HCl, 150 mM NaCl and 5 mM EDTA and. Recombinant human the phosphodiester α-GlcNAcase thus prepared was used for uncovering of phosphorylated acid-β-glucocerbrosidase. The amount of the phosphodiester α-GlcNAcase that can remove 1 nmol of GlcNAc per hour from GlcNAc-α-P-ManαMe is defined as 1 unit (Mullis, K., et al (1994) Biochem. J, 269, 1718-1726).

[0106] Preparation of Highly Phosphorylated GBA

[0107] Partially purified GBA (1462 units) was phosphorylated by incubating with GlcNAc-phosphotransferase (100,000 unit) in 50 mM sodium acetate (pH 6.5), 20 mM MgCl₂, and 150 μM UDP-GlcNAc at 20° C. for 47 hrs. Next, 1000 units of uncovering enzyme and a phosphatase inhibitor cocktail II (Sigma) were added and the reaction was incubated an additional 6.5 hours at 20° C. Following the uncovering reaction β-glycerophosphate was added to 5 mM to inhibit phosphatase activity. Next, the HP-GBA was examined for its binding efficiency to a mannose-6-phosphate receptor column.

[0108] Mannose-6-Phosphate Receptor (M6P-R) Binding Pre- and Post-Phosphorylation

[0109] Mannose 6-phosphate (M6P) receptor was purified from bovine liver and coupled to a NHS-Sepharose 4B FF resin (Hoflack et al (1985) J Bio Chem 260:12008-120014). The resin was then packed in a 2 ml column and equilibrated with a buffer consisting of 50 mM Imidazole, 150 mM NaCl, 2 mM EDTA, 5 mM β-glycerophosphate, 0.05% v/v Triton X-100, 0.02% v/v sodium azide, at a flow rate of 0.1 ml/min. The GBA was injected onto the M6PR column and then a linear gradient of increasing M6P was applied after the column had been washed with 5.5 column volumes of the buffer mentioned above. A gradient, 0-1 mM M6P was allowed to develop over the next 10 ml at which time M6P was increased to 5 mM and maintained for 5 mls. At this time the column was returned to its initial conditions. During the entire chromatograph, 250 μl fractions were collected and subsequently assayed for GBA activity. The fluorescence of each well was then graphed and overlaid with the M6P gradient applied to the column. The elution of GBA is positively correlated to the amount of phosphorylated mannose present on the enzyme.

[0110] Following the GBA purification, a aliquot (˜250 U) was loaded onto a M6PR column, a M6P gradient was run, fractions collected and GBA activity assayed. As illustrated in FIG. 1A over 99% of the sample was eluted prior to the start of the M6P gradient suggesting there was no detectable phosphorylated GBA. As illustrated in FIG. 1B, there was a pronounced shift in the elution of phosphorylated GBA. Seventy-seven percent of the GBA activity was eluted after the start of the M6P gradient. Of that, 35% was eluted only after the addition of 5 mM M6P which suggest a phosphorylated GBA molecule.

[0111] GBA was poorly phosphorylated under culture conditions. However, upon treatment with GlcNAc phosphotransferase and phosphodiester α-GlcNAcase, a highly phosphorylated GBA was obtained.

[0112] Obviously, numerous modifications and variations on the present invention are possible in light of the above teachings. It is therefore to be understood that within the scope of the appended claims, the invention may be practiced otherwise than as specifically described herein.

1 27 1 3600 DNA hybrid 1 atggagacag acacactcct gctatgggta ctgctgctct gggttccagg ttccactggt 60 gacgaagatc aggtagatcc gcggttaatc gacggtaagc ttagccgaga tcaataccat 120 gttttgtttg attcctatag agacaatatt gctggaaagt cctttcagaa tcggctttgt 180 ctgcccatgc cgattgacgt tgtttacacc tgggtgaatg gcacagatct tgaactactg 240 aaggaactac agcaggtcag agaacagatg gaggaggagc agaaagcaat gagagaaatc 300 cttgggaaaa acacaacgga acctactaag aagagtgaga agcagttaga gtgtttgcta 360 acacactgca ttaaggtgcc aatgcttgtc ctggacccag ccctgccagc caacatcacc 420 ctgaaggacc tgccatctct ttatccttct tttcattctg ccagtgacat tttcaatgtt 480 gcaaaaccaa aaaacccttc taccaatgtc tcagttgttg tttttgacag tactaaggat 540 gttgaagatg cccactctgg actgcttaaa ggaaatagca gacagacagt atggaggggc 600 tacttgacaa cagataaaga agtccctgga ttagtgctaa tgcaagattt ggctttcctg 660 agtggatttc caccaacatt caaggaaaca aatcaactaa aaacaaaatt gccagaaaat 720 ctttcctcta aagtcaaact gttgcagttg tattcagagg ccagtgtagc gcttctaaaa 780 ctgaataacc ccaaggattt tcaagaattg aataagcaaa ctaagaagaa catgaccatt 840 gatggaaaag aactgaccat aagtcctgca tatttattat gggatctgag cgccatcagc 900 cagtctaagc aggatgaaga catctctgcc agtcgttttg aagataacga agaactgagg 960 tactcattgc gatctatcga gaggcatgca ccatgggttc ggaatatttt cattgtcacc 1020 aacgggcaga ttccatcctg gctgaacctt gacaatcctc gagtgacaat agtaacacac 1080 caggatgttt ttcgaaattt gagccacttg cctaccttta gttcacctgc tattgaaagt 1140 cacgttcatc gcatcgaagg gctgtcccag aagtttattt acctaaatga tgatgtcatg 1200 tttgggaagg atgtctggcc agatgatttt tacagtcact ccaaaggcca gaaggtttat 1260 ttgacatggc ctgtgccaaa ctgtgccgag ggctgcccag gttcctggat taaggatggc 1320 tattgtgaca aggcttgtaa taattcagcc tgcgattggg atggtgggga ttgctctgga 1380 aacagtggag ggagtcgcta tattgcagga ggtggaggta ctgggagtat tggagttgga 1440 cagccctggc agtttggtgg aggaataaac agtgtctctt actgtaatca gggatgtgcg 1500 aattcctggc tcgctgataa gttctgtgac caagcatgca atgtcttgtc ctgtgggttt 1560 gatgctggcg actgtgggca agatcatttt catgaattgt ataaagtgat ccttctccca 1620 aaccagactc actatattat tccaaaaggt gaatgcctgc cttatttcag ctttgcagaa 1680 gtagccaaaa gaggagttga aggtgcctat agtgacaatc caataattcg acatgcttct 1740 attgccaaca agtggaaaac catccacctc ataatgcaca gtggaatgaa tgccaccaca 1800 atacatttta atctcacgtt tcaaaataca aacgatgaag agttcaaaat gcagataaca 1860 gtggaggtgg acacaaggga gggaccaaaa ctgaattcta cggcccagaa gggttacgaa 1920 aatttagtta gtcccataac acttcttcca gaggcggaaa tcctttttga ggatattccc 1980 aaagaaaaac gcttcccgaa gtttaagaga catgatgtta actcaacaag gagagcccag 2040 gaagaggtga aaattcccct ggtaaatatt tcactccttc caaaagacgc ccagttgagt 2100 ctcaatacct tggatttgca actggaacat ggagacatca ctttgaaagg atacaatttg 2160 tccaagtcag ccttgctgag atcatttctg atgaactcac agcatgctaa aataaaaaat 2220 caagctataa taacagatga aacaaatgac agtttggtgg ctccacagga aaaacaggtt 2280 cataaaagca tcttgccaaa cagcttagga gtgtctgaaa gattgcagag gttgactttt 2340 cctgcagtga gtgtaaaagt gaatggtcat gaccagggtc agaatccacc cctggacttg 2400 gagaccacag caagatttag agtggaaact cacacccaaa aaaccatagg cggaaatgtg 2460 acaaaagaaa agcccccatc tctgattgtt ccactggaaa gccagatgac aaaagaaaag 2520 aaaatcacag ggaaagaaaa agagaacagt agaatggagg aaaatgctga aaatcacata 2580 ggcgttactg aagtgttact tggaagaaag ctgcagcatt acacagatag ttacttgggc 2640 tttttgccat gggagaaaaa aaagtatttc ctagatcttc tcgacgaaga agagtcattg 2700 aagacacaat tggcctactt cactgatagc aagaatagag ccagatacaa gagagataca 2760 tttgcagatt ccctcagata tgtaaataaa attctaaata gcaagtttgg attcacatcg 2820 cggaaagtcc ctgctcacat gcctcacatg attgaccgga ttgttatgca agaactgcaa 2880 gatatgttcc ctgaagaatt tgacaagacg tcatttcaca aagtgcgcca ttctgaggat 2940 atgcagtttg ccttctctta tttttattat ctcatgagtg cagtgcagcc actgaatata 3000 tctcaagtct ttgatgaagt tgatacagat caatctggtg tcttgtctga cagagaaatc 3060 cgaacactgg ctaccagaat tcacgaactg ccgttaagtt tgcaggattt gacaggtctg 3120 gaacacatgc taataaattg ctcaaaaatg cttcctgctg atatcacgca gctaaataat 3180 attccaccaa ctcaggaatc ctactatgat cccaacctgc caccggtcac taaaagtcta 3240 gtaacaaact gtaaaccagt aactgacaaa atccacaaag catataagga caaaaacaaa 3300 tataggtttg aaatcatggg agaagaagaa atcgctttta aaatgattcg taccaacgtt 3360 tctcatgtgg ttggccagtt ggatgacata agaaaaaacc ctaggaagtt tgtttgcctg 3420 aatgacaaca ttgaccacaa tcataaagat gctcagacag tgaaggctgt tctcagggac 3480 ttctatgaat ccatgttccc cataccttcc caatttgaac tgccaagaga gtatcgaaac 3540 cgtttccttc atatgcatga gctgcaggaa tggagggctt atcgagacaa attgaagtag 3600 2 1199 PRT hybrid 2 Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 1 5 10 15 Gly Ser Thr Gly Asp Glu Asp Gln Val Asp Pro Arg Leu Ile Asp Gly 20 25 30 Lys Leu Ser Arg Asp Gln Tyr His Val Leu Phe Asp Ser Tyr Arg Asp 35 40 45 Asn Ile Ala Gly Lys Ser Phe Gln Asn Arg Leu Cys Leu Pro Met Pro 50 55 60 Ile Asp Val Val Tyr Thr Trp Val Asn Gly Thr Asp Leu Glu Leu Leu 65 70 75 80 Lys Glu Leu Gln Gln Val Arg Glu Gln Met Glu Glu Glu Gln Lys Ala 85 90 95 Met Arg Glu Ile Leu Gly Lys Asn Thr Thr Glu Pro Thr Lys Lys Ser 100 105 110 Glu Lys Gln Leu Glu Cys Leu Leu Thr His Cys Ile Lys Val Pro Met 115 120 125 Leu Val Leu Asp Pro Ala Leu Pro Ala Asn Ile Thr Leu Lys Asp Leu 130 135 140 Pro Ser Leu Tyr Pro Ser Phe His Ser Ala Ser Asp Ile Phe Asn Val 145 150 155 160 Ala Lys Pro Lys Asn Pro Ser Thr Asn Val Ser Val Val Val Phe Asp 165 170 175 Ser Thr Lys Asp Val Glu Asp Ala His Ser Gly Leu Leu Lys Gly Asn 180 185 190 Ser Arg Gln Thr Val Trp Arg Gly Tyr Leu Thr Thr Asp Lys Glu Val 195 200 205 Pro Gly Leu Val Leu Met Gln Asp Leu Ala Phe Leu Ser Gly Phe Pro 210 215 220 Pro Thr Phe Lys Glu Thr Asn Gln Leu Lys Thr Lys Leu Pro Glu Asn 225 230 235 240 Leu Ser Ser Lys Val Lys Leu Leu Gln Leu Tyr Ser Glu Ala Ser Val 245 250 255 Ala Leu Leu Lys Leu Asn Asn Pro Lys Asp Phe Gln Glu Leu Asn Lys 260 265 270 Gln Thr Lys Lys Asn Met Thr Ile Asp Gly Lys Glu Leu Thr Ile Ser 275 280 285 Pro Ala Tyr Leu Leu Trp Asp Leu Ser Ala Ile Ser Gln Ser Lys Gln 290 295 300 Asp Glu Asp Ile Ser Ala Ser Arg Phe Glu Asp Asn Glu Glu Leu Arg 305 310 315 320 Tyr Ser Leu Arg Ser Ile Glu Arg His Ala Pro Trp Val Arg Asn Ile 325 330 335 Phe Ile Val Thr Asn Gly Gln Ile Pro Ser Trp Leu Asn Leu Asp Asn 340 345 350 Pro Arg Val Thr Ile Val Thr His Gln Asp Val Phe Arg Asn Leu Ser 355 360 365 His Leu Pro Thr Phe Ser Ser Pro Ala Ile Glu Ser His Val His Arg 370 375 380 Ile Glu Gly Leu Ser Gln Lys Phe Ile Tyr Leu Asn Asp Asp Val Met 385 390 395 400 Phe Gly Lys Asp Val Trp Pro Asp Asp Phe Tyr Ser His Ser Lys Gly 405 410 415 Gln Lys Val Tyr Leu Thr Trp Pro Val Pro Asn Cys Ala Glu Gly Cys 420 425 430 Pro Gly Ser Trp Ile Lys Asp Gly Tyr Cys Asp Lys Ala Cys Asn Asn 435 440 445 Ser Ala Cys Asp Trp Asp Gly Gly Asp Cys Ser Gly Asn Ser Gly Gly 450 455 460 Ser Arg Tyr Ile Ala Gly Gly Gly Gly Thr Gly Ser Ile Gly Val Gly 465 470 475 480 Gln Pro Trp Gln Phe Gly Gly Gly Ile Asn Ser Val Ser Tyr Cys Asn 485 490 495 Gln Gly Cys Ala Asn Ser Trp Leu Ala Asp Lys Phe Cys Asp Gln Ala 500 505 510 Cys Asn Val Leu Ser Cys Gly Phe Asp Ala Gly Asp Cys Gly Gln Asp 515 520 525 His Phe His Glu Leu Tyr Lys Val Ile Leu Leu Pro Asn Gln Thr His 530 535 540 Tyr Ile Ile Pro Lys Gly Glu Cys Leu Pro Tyr Phe Ser Phe Ala Glu 545 550 555 560 Val Ala Lys Arg Gly Val Glu Gly Ala Tyr Ser Asp Asn Pro Ile Ile 565 570 575 Arg His Ala Ser Ile Ala Asn Lys Trp Lys Thr Ile His Leu Ile Met 580 585 590 His Ser Gly Met Asn Ala Thr Thr Ile His Phe Asn Leu Thr Phe Gln 595 600 605 Asn Thr Asn Asp Glu Glu Phe Lys Met Gln Ile Thr Val Glu Val Asp 610 615 620 Thr Arg Glu Gly Pro Lys Leu Asn Ser Thr Ala Gln Lys Gly Tyr Glu 625 630 635 640 Asn Leu Val Ser Pro Ile Thr Leu Leu Pro Glu Ala Glu Ile Leu Phe 645 650 655 Glu Asp Ile Pro Lys Glu Lys Arg Phe Pro Lys Phe Lys Arg His Asp 660 665 670 Val Asn Ser Thr Arg Arg Ala Gln Glu Glu Val Lys Ile Pro Leu Val 675 680 685 Asn Ile Ser Leu Leu Pro Lys Asp Ala Gln Leu Ser Leu Asn Thr Leu 690 695 700 Asp Leu Gln Leu Glu His Gly Asp Ile Thr Leu Lys Gly Tyr Asn Leu 705 710 715 720 Ser Lys Ser Ala Leu Leu Arg Ser Phe Leu Met Asn Ser Gln His Ala 725 730 735 Lys Ile Lys Asn Gln Ala Ile Ile Thr Asp Glu Thr Asn Asp Ser Leu 740 745 750 Val Ala Pro Gln Glu Lys Gln Val His Lys Ser Ile Leu Pro Asn Ser 755 760 765 Leu Gly Val Ser Glu Arg Leu Gln Arg Leu Thr Phe Pro Ala Val Ser 770 775 780 Val Lys Val Asn Gly His Asp Gln Gly Gln Asn Pro Pro Leu Asp Leu 785 790 795 800 Glu Thr Thr Ala Arg Phe Arg Val Glu Thr His Thr Gln Lys Thr Ile 805 810 815 Gly Gly Asn Val Thr Lys Glu Lys Pro Pro Ser Leu Ile Val Pro Leu 820 825 830 Glu Ser Gln Met Thr Lys Glu Lys Lys Ile Thr Gly Lys Glu Lys Glu 835 840 845 Asn Ser Arg Met Glu Glu Asn Ala Glu Asn His Ile Gly Val Thr Glu 850 855 860 Val Leu Leu Gly Arg Lys Leu Gln His Tyr Thr Asp Ser Tyr Leu Gly 865 870 875 880 Phe Leu Pro Trp Glu Lys Lys Lys Tyr Phe Leu Asp Leu Leu Asp Glu 885 890 895 Glu Glu Ser Leu Lys Thr Gln Leu Ala Tyr Phe Thr Asp Ser Lys Asn 900 905 910 Arg Ala Arg Tyr Lys Arg Asp Thr Phe Ala Asp Ser Leu Arg Tyr Val 915 920 925 Asn Lys Ile Leu Asn Ser Lys Phe Gly Phe Thr Ser Arg Lys Val Pro 930 935 940 Ala His Met Pro His Met Ile Asp Arg Ile Val Met Gln Glu Leu Gln 945 950 955 960 Asp Met Phe Pro Glu Glu Phe Asp Lys Thr Ser Phe His Lys Val Arg 965 970 975 His Ser Glu Asp Met Gln Phe Ala Phe Ser Tyr Phe Tyr Tyr Leu Met 980 985 990 Ser Ala Val Gln Pro Leu Asn Ile Ser Gln Val Phe Asp Glu Val Asp 995 1000 1005 Thr Asp Gln Ser Gly Val Leu Ser Asp Arg Glu Ile Arg Thr Leu 1010 1015 1020 Ala Thr Arg Ile His Glu Leu Pro Leu Ser Leu Gln Asp Leu Thr 1025 1030 1035 Gly Leu Glu His Met Leu Ile Asn Cys Ser Lys Met Leu Pro Ala 1040 1045 1050 Asp Ile Thr Gln Leu Asn Asn Ile Pro Pro Thr Gln Glu Ser Tyr 1055 1060 1065 Tyr Asp Pro Asn Leu Pro Pro Val Thr Lys Ser Leu Val Thr Asn 1070 1075 1080 Cys Lys Pro Val Thr Asp Lys Ile His Lys Ala Tyr Lys Asp Lys 1085 1090 1095 Asn Lys Tyr Arg Phe Glu Ile Met Gly Glu Glu Glu Ile Ala Phe 1100 1105 1110 Lys Met Ile Arg Thr Asn Val Ser His Val Val Gly Gln Leu Asp 1115 1120 1125 Asp Ile Arg Lys Asn Pro Arg Lys Phe Val Cys Leu Asn Asp Asn 1130 1135 1140 Ile Asp His Asn His Lys Asp Ala Gln Thr Val Lys Ala Val Leu 1145 1150 1155 Arg Asp Phe Tyr Glu Ser Met Phe Pro Ile Pro Ser Gln Phe Glu 1160 1165 1170 Leu Pro Arg Glu Tyr Arg Asn Arg Phe Leu His Met His Glu Leu 1175 1180 1185 Gln Glu Trp Arg Ala Tyr Arg Asp Lys Leu Lys 1190 1195 3 5597 DNA Homo sapiens 3 cggagccgag cgggcgtccg tcgccggagc tgcaatgagc ggcgcccgga ggctgtgacc 60 tgcgcgcggc ggcccgaccg gggcccctga atggcggctc gctgaggcgg cggcggcggc 120 ggcggctcag gctcctcggg gcgtggcgtg gcggtgaagg ggtgatgctg ttcaagctcc 180 tgcagagaca aacctatacc tgcctgtccc acaggtatgg gctctacgtg tgcttcttgg 240 gcgtcgttgt caccatcgtc tccgccttcc agttcggaga ggtggttctg gaatggagcc 300 gagatcaata ccatgttttg tttgattcct atagagacaa tattgctgga aagtcctttc 360 agaatcggct ttgtctgccc atgccgattg acgttgttta cacctgggtg aatggcacag 420 atcttgaact actgaaggaa ctacagcagg tcagagaaca gatggaggag gagcagaaag 480 caatgagaga aatccttggg aaaaacacaa cggaacctac taagaagagt gagaagcagt 540 tagagtgttt gctaacacac tgcattaagg tgccaatgct tgtactggac ccagccctgc 600 cagccaacat caccctgaag gacgtgccat ctctttatcc ttcttttcat tctgccagtg 660 acattttcaa tgttgcaaaa ccaaaaaacc cttctaccaa tgtctcagtt gttgtttttg 720 acagtactaa ggatgttgaa gatgcccact ctggactgct taaaggaaat agcagacaga 780 cagtatggag ggggtacttg acaacagata aagaagtccc tggattagtg ctaatgcaag 840 atttggcttt cctgagtgga tttccaccaa cattcaagga aacaaatcaa ctaaaaacaa 900 aattgccaga aaatctttcc tctaaagtca aactgttgca gttgtattca gaggccagtg 960 tagcgcttct aaaactgaat aaccccaagg attttcaaga attgaataag caaactaaga 1020 agaacatgac cattgatgga aaagaactga ccataagtcc tgcatattta ttatgggatc 1080 tgagcgccat cagccagtct aagcaggatg aagacatctc tgccagtcgt tttgaagata 1140 acgaagaact gaggtactca ttgcgatcta tcgagaggca tgcaccatgg gttcggaata 1200 ttttcattgt caccaacggg cagattccat cctggctgaa ccttgacaat cctcgagtga 1260 caatagtaac acaccaggat gtttttcgaa atttgagcca cttgcctacc tttagttcac 1320 ctgctattga aagtcacatt catcgcatcg aagggctgtc ccagaagttt atttacctaa 1380 atgatgatgt catgtttggg aaggatgtct ggccagatga tttttacagt cactccaaag 1440 gccagaaggt ttatttgaca tggcctgtgc caaactgtgc cgagggctgc ccaggttcct 1500 ggattaagga tggctattgt gacaaggctt gtaataattc agcctgcgat tgggatggtg 1560 gggattgctc tggaaacagt ggagggagtc gctatattgc aggaggtgga ggtactggga 1620 gtattggagt tggacacccc tggcagtttg gtggaggaat aaacagtgtc tcttactgta 1680 atcagggatg tgcgaattcc tggctcgctg ataagttctg tgaccaagca tgcaatgtct 1740 tgtcctgtgg gtttgatgct ggcgactgtg ggcaagatca ttttcatgaa ttgtataaag 1800 tgatccttct cccaaaccag actcactata ttattccaaa aggtgaatgc ctgccttatt 1860 tcagctttgc agaagtagcc aaaagaggag ttgaaggtgc ctatagtgac aatccaataa 1920 ttcgacatgc ttctattgcc aacaagtgga aaaccatcca cctcataatg cacagtggaa 1980 tgaatgccac cacaatacat tttaatctca cgtttcaaaa tacaaacgat gaagagttca 2040 aaatgcagat aacagtggag gtggacacaa gggagggacc aaaactgaat tctacggccc 2100 agaagggtta cgaaaattta gttagtccca taacacttct tccagaggcg gaaatccttt 2160 ttgaggatat tcccaaagaa aaacgcttcc cgaagtttaa gagacatgat gttaactcaa 2220 caaggagagc ccaggaagag gtgaaaattc ccctggtaaa tatttcactc cttccaaaag 2280 acgcccagtt gagtctcaat accttggatt tgcaactgga acatggagac atcactttga 2340 aaggatacaa tttgtccaag tcagccttgc tgagatcatt tctgatgaac tcacagcatg 2400 ctaaaataaa aaatcaagct ataataacag atgaaacaaa tgacagtttg gtggctccac 2460 aggaaaaaca ggttcataaa agcatcttgc caaacagctt aggagtgtct gaaagattgc 2520 agaggttgac ttttcctgca gtgagtgtaa aagtgaatgg tcatgaccag ggtcagaatc 2580 cacccctgga cttggagacc acagcaagat ttagagtgga aactcacacc caaaaaacca 2640 taggcggaaa tgtgacaaaa gaaaagcccc catctctgat tgttccactg gaaagccaga 2700 tgacaaaaga aaagaaaatc acagggaaag aaaaagagaa cagtagaatg gaggaaaatg 2760 ctgaaaatca cataggcgtt actgaagtgt tacttggaag aaagctgcag cattacacag 2820 atagttactt gggctttttg ccatgggaga aaaaaaagta tttccaagat cttctcgacg 2880 aagaagagtc attgaagaca caattggcat acttcactga tagcaaaaat actgggaggc 2940 aactaaaaga tacatttgca gattccctca gatatgtaaa taaaattcta aatagcaagt 3000 ttggattcac atcgcggaaa gtccctgctc acatgcctca catgattgac cggattgtta 3060 tgcaagaact gcaagatatg ttccctgaag aatttgacaa gacgtcattt cacaaagtgc 3120 gccattctga ggatatgcag tttgccttct cttattttta ttatctcatg agtgcagtgc 3180 agccactgaa tatatctcaa gtctttgatg aagttgatac agatcaatct ggtgtcttgt 3240 ctgacagaga aatccgaaca ctggctacca gaattcacga actgccgtta agtttgcagg 3300 atttgacagg tctggaacac atgctaataa attgctcaaa aatgcttcct gctgatatca 3360 cgcagctaaa taatattcca ccaactcagg aatcctacta tgatcccaac ctgccaccgg 3420 tcactaaaag tctagtaaca aactgtaaac cagtaactga caaaatccac aaagcatata 3480 aggacaaaaa caaatatagg tttgaaatca tgggagaaga agaaatcgct tttaaaatga 3540 ttcgtaccaa cgtttctcat gtggttggcc agttggatga cataagaaaa aaccctagga 3600 agtttgtttg cctgaatgac aacattgacc acaatcataa agatgctcag acagtgaagg 3660 ctgttctcag ggacttctat gaatccatgt tccccatacc ttcccaattt gaactgccaa 3720 gagagtatcg aaaccgtttc cttcatatgc atgagctgca ggaatggagg gcttatcgag 3780 acaaattgaa gttttggacc cattgtgtac tagcaacatt gattatgttt actatattct 3840 cattttttgc tgagcagtta attgcactta agcggaagat atttcccaga aggaggatac 3900 acaaagaagc tagtcccaat cgaatcagag tatagaagat cttcatttga aaaccatcta 3960 cctcagcatt tactgagcat tttaaaactc agcttcacag agatgtcttt gtgatgtgat 4020 gcttagcagt ttggcccgaa gaaggaaaat atccagtacc atgctgtttt gtggcatgaa 4080 tatagcccac tgactaggaa ttatttaacc aacccactga aaacttgtgt gtcgagcagc 4140 tctgaactga ttttactttt aaagaatttg ctcatggacc tgtcatcctt tttataaaaa 4200 ggctcactga caagagacag ctgttaattt cccacagcaa tcattgcaga ctaactttat 4260 taggagaagc ctatgccagc tgggagtgat tgctaagagg ctccagtctt tgcattccaa 4320 agccttttgc taaagttttg cacttttttt ttttcatttc ccatttttaa gtagttacta 4380 agttaactag ttattcttgc ttctgagtat aacgaattgg gatgtctaaa cctattttta 4440 tagatgttat ttaaataatg cagcaatatc acctcttatt gacaatacct aaattatgag 4500 ttttattaat atttaagact gtaaatggtc ttaaaccact aactactgaa gagctcaatg 4560 attgacatct gaaatgcttt gtaattattg acttcagccc ctaagaatgc tatgatttca 4620 cgtgcaggtc taatttcaac aggctagagt tagtactact taccagatgt aattatgttt 4680 tggaaatgta catattcaaa cagaagtgcc tcattttaga aatgagtagt gctgatggca 4740 ctggcacatt acagtggtgt cttgtttaat actcattggt atattccagt agctatctct 4800 ctcagttggt ttttgataga acagaggcca gcaaactttc tttgtaaaag gctggttagt 4860 aaattattgc aggccacctg tgtctttgtc atacattctt cttgctgttg tttagtttgt 4920 tttttttcaa acaaccctct aaaaatgtaa aaaccatgtt tagcttgcag ctgtacaaaa 4980 actgcccacc agccagatgt gaccctcagg ccatcatttg ccaatcactg agaattattt 5040 ttgttgttgt tgttgttgtt gtttttgaga cagagtctct ctctgttgcc caggctggag 5100 tgcagtggcg caatctcagc tcactgcaac ctccgcctcc cgggttcaag cagttctgtc 5160 tcagccttct gagtagctgg gactacaggt gcatgccacc acaccctgct aatttttgta 5220 tttttagtag agacgggggt tccaccatat tggtcaggct tatcttgaac tcctgacctc 5280 aggtgatcca cctgcctctg cctcccaaag tgctgagatt acaggcataa gccagtgcac 5340 ccagccgaga attagtattt ttatgtatgg ttaaaccttg gcgtctagcc atattttatg 5400 tcataataca atggatttgt gaagagcaga ttccatgagt aactctgaca ggtattttag 5460 atcatgatct caacaatatt cctcccaaat ggcatacatc ttttgtacaa agaacttgaa 5520 atgtaaatac tgtgtttgtg ctgtaagagt tgtgtatttc aaaaactgaa atctcataaa 5580 aagttaaatt ttgaaaa 5597 4 928 PRT Homo sapiens 4 Met Leu Phe Lys Leu Leu Gln Arg Gln Thr Tyr Thr Cys Leu Ser His 1 5 10 15 Arg Tyr Gly Leu Tyr Val Cys Phe Leu Gly Val Val Val Thr Ile Val 20 25 30 Ser Ala Phe Gln Phe Gly Glu Val Val Leu Glu Trp Ser Arg Asp Gln 35 40 45 Tyr His Val Leu Phe Asp Ser Tyr Arg Asp Asn Ile Ala Gly Lys Ser 50 55 60 Phe Gln Asn Arg Leu Cys Leu Pro Met Pro Ile Asp Val Val Tyr Thr 65 70 75 80 Trp Val Asn Gly Thr Asp Leu Glu Leu Leu Lys Glu Leu Gln Gln Val 85 90 95 Arg Glu Gln Met Glu Glu Glu Gln Lys Ala Met Arg Glu Ile Leu Gly 100 105 110 Lys Asn Thr Thr Glu Pro Thr Lys Lys Ser Glu Lys Gln Leu Glu Cys 115 120 125 Leu Leu Thr His Cys Ile Lys Val Pro Met Leu Val Leu Asp Pro Ala 130 135 140 Leu Pro Ala Asn Ile Thr Leu Lys Asp Val Pro Ser Leu Tyr Pro Ser 145 150 155 160 Phe His Ser Ala Ser Asp Ile Phe Asn Val Ala Lys Pro Lys Asn Pro 165 170 175 Ser Thr Asn Val Ser Val Val Val Phe Asp Ser Thr Lys Asp Val Glu 180 185 190 Asp Ala His Ser Gly Leu Leu Lys Gly Asn Ser Arg Gln Thr Val Trp 195 200 205 Arg Gly Tyr Leu Thr Thr Asp Lys Glu Val Pro Gly Leu Val Leu Met 210 215 220 Gln Asp Leu Ala Phe Leu Ser Gly Phe Pro Pro Thr Phe Lys Glu Thr 225 230 235 240 Asn Gln Leu Lys Thr Lys Leu Pro Glu Asn Leu Ser Ser Lys Val Lys 245 250 255 Leu Leu Gln Leu Tyr Ser Glu Ala Ser Val Ala Leu Leu Lys Leu Asn 260 265 270 Asn Pro Lys Asp Phe Gln Glu Leu Asn Lys Gln Thr Lys Lys Asn Met 275 280 285 Thr Ile Asp Gly Lys Glu Leu Thr Ile Ser Pro Ala Tyr Leu Leu Trp 290 295 300 Asp Leu Ser Ala Ile Ser Gln Ser Lys Gln Asp Glu Asp Ile Ser Ala 305 310 315 320 Ser Arg Phe Glu Asp Asn Glu Glu Leu Arg Tyr Ser Leu Arg Ser Ile 325 330 335 Glu Arg His Ala Pro Trp Val Arg Asn Ile Phe Ile Val Thr Asn Gly 340 345 350 Gln Ile Pro Ser Trp Leu Asn Leu Asp Asn Pro Arg Val Thr Ile Val 355 360 365 Thr His Gln Asp Val Phe Arg Asn Leu Ser His Leu Pro Thr Phe Ser 370 375 380 Ser Pro Ala Ile Glu Ser His Ile His Arg Ile Glu Gly Leu Ser Gln 385 390 395 400 Lys Phe Ile Tyr Leu Asn Asp Asp Val Met Phe Gly Lys Asp Val Trp 405 410 415 Pro Asp Asp Phe Tyr Ser His Ser Lys Gly Gln Lys Val Tyr Leu Thr 420 425 430 Trp Pro Val Pro Asn Cys Ala Glu Gly Cys Pro Gly Ser Trp Ile Lys 435 440 445 Asp Gly Tyr Cys Asp Lys Ala Cys Asn Asn Ser Ala Cys Asp Trp Asp 450 455 460 Gly Gly Asp Cys Ser Gly Asn Ser Gly Gly Ser Arg Tyr Ile Ala Gly 465 470 475 480 Gly Gly Gly Thr Gly Ser Ile Gly Val Gly His Pro Trp Gln Phe Gly 485 490 495 Gly Gly Ile Asn Ser Val Ser Tyr Cys Asn Gln Gly Cys Ala Asn Ser 500 505 510 Trp Leu Ala Asp Lys Phe Cys Asp Gln Ala Cys Asn Val Leu Ser Cys 515 520 525 Gly Phe Asp Ala Gly Asp Cys Gly Gln Asp His Phe His Glu Leu Tyr 530 535 540 Lys Val Ile Leu Leu Pro Asn Gln Thr His Tyr Ile Ile Pro Lys Gly 545 550 555 560 Glu Cys Leu Pro Tyr Phe Ser Phe Ala Glu Val Ala Lys Arg Gly Val 565 570 575 Glu Gly Ala Tyr Ser Asp Asn Pro Ile Ile Arg His Ala Ser Ile Ala 580 585 590 Asn Lys Trp Lys Thr Ile His Leu Ile Met His Ser Gly Met Asn Ala 595 600 605 Thr Thr Ile His Phe Asn Leu Thr Phe Gln Asn Thr Asn Asp Glu Glu 610 615 620 Phe Lys Met Gln Ile Thr Val Glu Val Asp Thr Arg Glu Gly Pro Lys 625 630 635 640 Leu Asn Ser Thr Ala Gln Lys Gly Tyr Glu Asn Leu Val Ser Pro Ile 645 650 655 Thr Leu Leu Pro Glu Ala Glu Ile Leu Phe Glu Asp Ile Pro Lys Glu 660 665 670 Lys Arg Phe Pro Lys Phe Lys Arg His Asp Val Asn Ser Thr Arg Arg 675 680 685 Ala Gln Glu Glu Val Lys Ile Pro Leu Val Asn Ile Ser Leu Leu Pro 690 695 700 Lys Asp Ala Gln Leu Ser Leu Asn Thr Leu Asp Leu Gln Leu Glu His 705 710 715 720 Gly Asp Ile Thr Leu Lys Gly Tyr Asn Leu Ser Lys Ser Ala Leu Leu 725 730 735 Arg Ser Phe Leu Met Asn Ser Gln His Ala Lys Ile Lys Asn Gln Ala 740 745 750 Ile Ile Thr Asp Glu Thr Asn Asp Ser Leu Val Ala Pro Gln Glu Lys 755 760 765 Gln Val His Lys Ser Ile Leu Pro Asn Ser Leu Gly Val Ser Glu Arg 770 775 780 Leu Gln Arg Leu Thr Phe Pro Ala Val Ser Val Lys Val Asn Gly His 785 790 795 800 Asp Gln Gly Gln Asn Pro Pro Leu Asp Leu Glu Thr Thr Ala Arg Phe 805 810 815 Arg Val Glu Thr His Thr Gln Lys Thr Ile Gly Gly Asn Val Thr Lys 820 825 830 Glu Lys Pro Pro Ser Leu Ile Val Pro Leu Glu Ser Gln Met Thr Lys 835 840 845 Glu Lys Lys Ile Thr Gly Lys Glu Lys Glu Asn Ser Arg Met Glu Glu 850 855 860 Asn Ala Glu Asn His Ile Gly Val Thr Glu Val Leu Leu Gly Arg Lys 865 870 875 880 Leu Gln His Tyr Thr Asp Ser Tyr Leu Gly Phe Leu Pro Trp Glu Lys 885 890 895 Lys Lys Tyr Phe Gln Asp Leu Leu Asp Glu Glu Glu Ser Leu Lys Thr 900 905 910 Gln Leu Ala Tyr Phe Thr Asp Ser Lys Asn Thr Gly Arg Gln Leu Lys 915 920 925 5 328 PRT Homo sapiens 5 Asp Thr Phe Ala Asp Ser Leu Arg Tyr Val Asn Lys Ile Leu Asn Ser 1 5 10 15 Lys Phe Gly Phe Thr Ser Arg Lys Val Pro Ala His Met Pro His Met 20 25 30 Ile Asp Arg Ile Val Met Gln Glu Leu Gln Asp Met Phe Pro Glu Glu 35 40 45 Phe Asp Lys Thr Ser Phe His Lys Val Arg His Ser Glu Asp Met Gln 50 55 60 Phe Ala Phe Ser Tyr Phe Tyr Tyr Leu Met Ser Ala Val Gln Pro Leu 65 70 75 80 Asn Ile Ser Gln Val Phe Asp Glu Val Asp Thr Asp Gln Ser Gly Val 85 90 95 Leu Ser Asp Arg Glu Ile Arg Thr Leu Ala Thr Arg Ile His Glu Leu 100 105 110 Pro Leu Ser Leu Gln Asp Leu Thr Gly Leu Glu His Met Leu Ile Asn 115 120 125 Cys Ser Lys Met Leu Pro Ala Asp Ile Thr Gln Leu Asn Asn Ile Pro 130 135 140 Pro Thr Gln Glu Ser Tyr Tyr Asp Pro Asn Leu Pro Pro Val Thr Lys 145 150 155 160 Ser Leu Val Thr Asn Cys Lys Pro Val Thr Asp Lys Ile His Lys Ala 165 170 175 Tyr Lys Asp Lys Asn Lys Tyr Arg Phe Glu Ile Met Gly Glu Glu Glu 180 185 190 Ile Ala Phe Lys Met Ile Arg Thr Asn Val Ser His Val Val Gly Gln 195 200 205 Leu Asp Asp Ile Arg Lys Asn Pro Arg Lys Phe Val Cys Leu Asn Asp 210 215 220 Asn Ile Asp His Asn His Lys Asp Ala Gln Thr Val Lys Ala Val Leu 225 230 235 240 Arg Asp Phe Tyr Glu Ser Met Phe Pro Ile Pro Ser Gln Phe Glu Leu 245 250 255 Pro Arg Glu Tyr Arg Asn Arg Phe Leu His Met His Glu Leu Gln Glu 260 265 270 Trp Arg Ala Tyr Arg Asp Lys Leu Lys Phe Trp Thr His Cys Val Leu 275 280 285 Ala Thr Leu Ile Met Phe Thr Ile Phe Ser Phe Phe Ala Glu Gln Leu 290 295 300 Ile Ala Leu Lys Arg Lys Ile Phe Pro Arg Arg Arg Ile His Lys Glu 305 310 315 320 Ala Ser Pro Asn Arg Ile Arg Val 325 6 1219 DNA Homo sapiens 6 gtagagcgca ggtgcgcggc tcgatggcgg cggggctggc gcggctcctg ttgctcctcg 60 ggctctcggc cggcgggccc gcgccggcag gtgcagcgaa gatgaaggtg gtggaggagc 120 ccaacgcgtt tggggtgaac aacccgttct tgcctcaggc cagtcgcctc caggccaaga 180 gggatccttc acccgtgtct ggacccgtgc atctcttccg actctcgggc aagtgcttca 240 gcctggtgga gtccacgtac aagtatgagt tctgcccgtt ccacaacgtg acccagcacg 300 agcagacctt ccgctggaac gcctacagtg ggatcctcgg catctggcac gagtgggaga 360 tcgccaacaa caccttcacg ggcatgtgga tgagggacgg tgacgcctgc cgttcccgga 420 gccggcagag caaggtggag ctggcgtgtg gaaaaagcaa ccggctggcc catgtgtccg 480 agccgagcac ctgcgtctat gcgctgacgt tcgagacccc cctcgtctgc cacccccacg 540 ccttgctagt gtacccaacc ctgccagagg ccctgcagcg gcagtgggac caggtagagc 600 aggacctggc cgatgagctg atcacccccc agggccatga gaagttgctg aggacacttt 660 ttgaggatgc tggctactta aagaccccag aagaaaatga acccacccag ctggagggag 720 gtcctgacag cttggggttt gagaccctgg aaaactgcag gaaggctcat aaagaactct 780 caaaggagat caaaaggctg aaaggtttgc tcacccagca cggcatcccc tacacgaggc 840 ccacagaaac ttccaacttg gagcacttgg gccacgagac gcccagagcc aagtctccag 900 agcagctgcg gggtgaccca ggactgcgtg ggagtttgtg accttgtggt gggagagcag 960 aggtggacgc ggccgagagc cctacagaga agctggctgg taggacccgc aggaccagct 1020 gaccaggctt gtgctcagag aagcagacaa aacaaagatt caaggtttta attaattccc 1080 atactgataa aaataactcc atgaattctg taaaccattg cataaatgct atagtgtaaa 1140 aaaatttaaa caagtgttaa ctttaaacag ttcgctacaa gtaaatgatt ataaatacta 1200 aaaaaaaaaa aaaaaaaaa 1219 7 305 PRT Homo sapiens 7 Met Ala Ala Gly Leu Ala Arg Leu Leu Leu Leu Leu Gly Leu Ser Ala 1 5 10 15 Gly Gly Pro Ala Pro Ala Gly Ala Ala Lys Met Lys Val Val Glu Glu 20 25 30 Pro Asn Ala Phe Gly Val Asn Asn Pro Phe Leu Pro Gln Ala Ser Arg 35 40 45 Leu Gln Ala Lys Arg Asp Pro Ser Pro Val Ser Gly Pro Val His Leu 50 55 60 Phe Arg Leu Ser Gly Lys Cys Phe Ser Leu Val Glu Ser Thr Tyr Lys 65 70 75 80 Tyr Glu Phe Cys Pro Phe His Asn Val Thr Gln His Glu Gln Thr Phe 85 90 95 Arg Trp Asn Ala Tyr Ser Gly Ile Leu Gly Ile Trp His Glu Trp Glu 100 105 110 Ile Ala Asn Asn Thr Phe Thr Gly Met Trp Met Arg Asp Gly Asp Ala 115 120 125 Cys Arg Ser Arg Ser Arg Gln Ser Lys Val Glu Leu Ala Cys Gly Lys 130 135 140 Ser Asn Arg Leu Ala His Val Ser Glu Pro Ser Thr Cys Val Tyr Ala 145 150 155 160 Leu Thr Phe Glu Thr Pro Leu Val Cys His Pro His Ala Leu Leu Val 165 170 175 Tyr Pro Thr Leu Pro Glu Ala Leu Gln Arg Gln Trp Asp Gln Val Glu 180 185 190 Gln Asp Leu Ala Asp Glu Leu Ile Thr Pro Gln Gly His Glu Lys Leu 195 200 205 Leu Arg Thr Leu Phe Glu Asp Ala Gly Tyr Leu Lys Thr Pro Glu Glu 210 215 220 Asn Glu Pro Thr Gln Leu Glu Gly Gly Pro Asp Ser Leu Gly Phe Glu 225 230 235 240 Thr Leu Glu Asn Cys Arg Lys Ala His Lys Glu Leu Ser Lys Glu Ile 245 250 255 Lys Arg Leu Lys Gly Leu Leu Thr Gln His Gly Ile Pro Tyr Thr Arg 260 265 270 Pro Thr Glu Thr Ser Asn Leu Glu His Leu Gly His Glu Thr Pro Arg 275 280 285 Ala Lys Ser Pro Glu Gln Leu Arg Gly Asp Pro Gly Leu Arg Gly Ser 290 295 300 Leu 305 8 5229 DNA Mus musculus 8 ggcggtgaag gggtgatgct gttcaagctc ctgcagagac agacctatac ctgcctatcc 60 cacaggtatg ggctctacgt ctgcttcgtg ggcgtcgttg tcaccatcgt ctcggctttc 120 cagttcggag aggtggttct ggaatggagc cgagatcagt accatgtttt gtttgattcc 180 tacagagaca acattgctgg gaaatccttt cagaatcggc tctgtctgcc catgccaatc 240 gacgtggttt acacctgggt gaatggcact gaccttgaac tgctaaagga gctacagcag 300 gtccgagagc acatggagga agagcagaga gccatgcggg aaaccctcgg gaagaacaca 360 accgaaccga caaagaagag tgagaagcag ctggaatgtc tgctgacgca ctgcattaag 420 gtgcccatgc ttgttctgga cccggccctg ccagccacca tcaccctgaa ggatctgcca 480 accctttacc catctttcca cgcgtccagc gacatgttca atgttgcgaa accaaaaaat 540 ccgtctacaa atgtccccgt tgtcgttttt gacactacta aggatgttga agacgcccat 600 gctggaccgt ttaagggagg ccagcaaaca gatgtttgga gagcctactt gacaacagac 660 aaagacgccc ctggcttagt gctgatacaa ggcttggcgt tcctgagtgg attcccaccg 720 accttcaagg agacgagtca actgaagaca aagctgccaa gaaaagcttt ccctctaaaa 780 ataaagctgt tgcggctgta ctcggaggcc agtgtcgctc ttctgaaatt gaataatccc 840 aagggtttcc aagagctgaa caagcagacc aagaagaaca tgaccatcga tgggaaggaa 900 ctgaccatca gccctgcgta tctgctgtgg gacctgagtg ccatcagcca gtccaagcag 960 gatgaggacg cgtctgccag ccgctttgag gataatgaag agctgaggta ctcgctgcga 1020 tctatcgaga gacacgcgcc atgggtacgg aatattttca ttgtcaccaa cgggcagatt 1080 ccatcctggc tgaaccttga caaccctcga gtgaccatag tgacccacca ggacattttc 1140 caaaatctga gccacttgcc tactttcagt tcccctgcta ttgaaagtca cattcaccgc 1200 atcgaagggc tgtcccagaa gtttatttat ctaaatgacg atgtcatgtt cggtaaggac 1260 gtctggccgg acgattttta cagccactcc aaaggtcaaa aggtttattt gacatggcct 1320 gtgccaaact gtgcagaggg ctgcccgggc tcctggataa aggacggcta ttgtgataag 1380 gcctgtaata cctcaccctg tgactgggat ggcggaaact gctctggtaa tactgcaggg 1440 aaccggtttg ttgcaagagg tgggggtacc gggaatattg gagctggaca gcactggcag 1500 tttggtggag gaataaacac catctcttac tgtaaccaag gatgtgcaaa ctcctggctg 1560 gctgacaagt tctgtgacca agcctgtaac gtcttatcct gcgggtttga tgctggtgac 1620 tgtggacaag atcattttca tgaattgtat aaagtaacac ttctcccaaa ccagactcac 1680 tatgttgtcc ccaaaggtga atacctgtct tatttcagct ttgcaaacat agccagaaaa 1740 agaattgaag ggacctacag cgacaacccc atcatccgcc acgcgtccat tgcaaacaag 1800 tggaaaaccc tacacctgat aatgcccggg gggatgaacg ccaccacgat ctattttaac 1860 ctcactcttc aaaacgccaa cgacgaagag ttcaagatcc agatagcagt agaggtggac 1920 acgagggagg cgcccaaact gaattctaca acccagaagg cctatgaaag tttggttagc 1980 ccagtgacac ctcttcctca ggctgacgtc ccttttgaag atgtccccaa agagaaacgc 2040 ttccccaaga tcaggagaca tgatgtaaat gcaacaggga gattccaaga ggaggtgaaa 2100 atcccccggg taaatatttc actccttccc aaagaggccc aggtgaggct gagcaacttg 2160 gatttgcaac tagaacgtgg agacatcact ctgaaaggat ataacttgtc caagtcagcc 2220 ctgctaaggt ctttcctggg gaattcacta gatactaaaa taaaacctca agctaggacc 2280 gatgaaacaa aaggcaacct ggaggtccca caggaaaacc cttctcacag acgtccacat 2340 ggctttgctg gtgaacacag atcagagaga tggactgccc cagcagagac agtgaccgtg 2400 aaaggccgtg accacgcttt gaatccaccc ccggtgttgg agaccaatgc aagattggcc 2460 cagcctacac taggcgtgac tgtgtccaaa gagaaccttt caccgctgat cgttccccca 2520 gaaagccact tgccaaaaga agaggagagt gacagggcag aaggcaatgc tgtacctgta 2580 aaggagttag tgcctggcag acggttgcag cagaattatc caggcttttt gccctgggag 2640 aaaaaaaagt atttccaaga ccttcttgat gaggaagagt cattgaagac ccagttggcg 2700 tactttacag accgcaaaca taccgggagg caactaaaag atacatttgc agactccctc 2760 cgatacgtca ataaaattct caacagcaag tttggattca catccaggaa agtccctgca 2820 cacatgccgc acatgattga caggatcgtt atgcaagaac tccaagatat gttccctgaa 2880 gaatttgaca agacttcatt tcacaaggtg cgtcactctg aggacatgca gtttgccttc 2940 tcctactttt attacctcat gagtgcagtt cagcccctca atatttccca agtctttcat 3000 gaagtagaca cagaccaatc tggtgtcttg tctgataggg aaatccgaac wctggccacg 3060 agaattcacg acctaccttt aagcttgcag gatttgacag gtttggaaca catgttaata 3120 aattgctcaa aaatgctccc cgctaatatc actcaactca acaacatccc accgactcag 3180 gaagcatact acgaccccaa cctgcctccg gtcactaaga gtcttgtcac caactgtaag 3240 ccagtaactg acaagatcca caaagcctat aaagacaaga acaaatacag gtttgaaatc 3300 atgggagagg aagaaatcgc tttcaagatg atacgaacca atgtttctca tgtggttggt 3360 cagttggatg acatcagaaa aaaccccagg aagttcgttt gtctgaatga caacattgac 3420 cacaaccata aagatgcccg gacagtgaag gctgtcctca gggacttcta tgagtccatg 3480 tttcccatac cttcccagtt tgagctgcca agagagtatc ggaaccgctt tctgcacatg 3540 catgagctcc aagaatggcg ggcatatcga gacaagctga agttttggac ccactgcgta 3600 ctagcaacgt tgattatatt tactatattc tcattttttg ctgaacagat aattgctctg 3660 aagcgaaaga tatttcccag gaggaggata cacaaagaag ctagtccaga ccgaatcagg 3720 gtgtagaaga tcttcatttg aaagtcacct accttagcat ctgtgaacat ctccctcctc 3780 gacaccacag cggagtccct gtgatgtggc acagaggcag cctcgtgggg agaagggaca 3840 tcgtgcagac cgggttcttc tgcaatggga agagagccca ctgacctgga attattcagc 3900 acactaagaa cctgtgtcaa tagcttgtac agcttgtact tttaaaggat ttgccgaagg 3960 acctgtcggc ttgttgacaa accctccctg acaagctgct ggtttcttcc cccagttact 4020 gcagactgag aaaccagtcc atcttgaaag caagtgcgga ggggccccag tctttgcatt 4080 ccaaagcttt ccagcataat ttctggcttg tctcctcctt tgatccattt cccatttttt 4140 tttaaaaaac aataagtggc tactaagtta gtcattctca cttctcaaaa taacaaatca 4200 ggatgtcaaa acatttgtat agatcttatt taaataatat agaacgatta cttctttagc 4260 ctatctaaat tattgatttt tattaacagt caagtggtct tgaaccgcta acaactactg 4320 aagagctcga gattgacgtt gaaagtgctt tgagcttgtt taactcattc cccaagaata 4380 ctgtgacctc gtgtgcgggc ctgattgcga agggctagtg tcacgtagca gtgctgctca 4440 ccggatgtaa ttatgtcgtg gaaatgtaca tacagacaaa agtgcctcac ttcagaaatg 4500 agtagtgctg atggcaccag cgagtgatgg tgtccatttg gaaacccatg ataccttcca 4560 atgcccaccc tgcttacttt atacagagca ggggttaacc aacttctgtc aaagaacagt 4620 aaagaacttg agatacatcc atctttgtca aatagttttc cttgctaaca tttattattg 4680 ttggtgtttt gggaggttta ttttatttta ttgctttgtt atttttcaag acggggattc 4740 tctgtgtagc tctggctgtt tggtaattca ctctaaagac caggctggcc ttgaacttag 4800 agattcacct gcttctgctt cctgaatggt aggacatgtg cccacattgc ctacccaccc 4860 cccttttggg gggggtgagc aactcaataa aaagatgaaa acctgcttta gtttgcagct 4920 atacaaaagc agcaggcctc agccagactt gacccccggg gccattgttg gcccacggga 4980 gaatcatttt tgacgtgggt aagcaaaccc tgatattggt catgctgtgt tatgtcatta 5040 tgtggtggtt ttgaattttg gaagatattt tcagtcatga tttcagtagt attcctccaa 5100 aatggcacac atttttgtaa taagaacttg aaatgtaaat attgtgtttg tgctgtaaat 5160 tttgtgtatt tcaaaaactg aagtttcata aaaaaacaca cttattggaa aaaaaaaaaa 5220 aaaaaaaaa 5229 9 908 PRT Mus musculus 9 Met Leu Phe Lys Leu Leu Gln Arg Gln Thr Tyr Thr Cys Leu Ser His 1 5 10 15 Arg Tyr Gly Leu Tyr Val Cys Phe Val Gly Val Val Val Thr Ile Val 20 25 30 Ser Ala Phe Gln Phe Gly Glu Val Val Leu Glu Trp Ser Arg Asp Gln 35 40 45 Tyr His Val Leu Phe Asp Ser Tyr Arg Asp Asn Ile Ala Gly Lys Ser 50 55 60 Phe Gln Asn Arg Leu Cys Leu Pro Met Pro Ile Asp Val Val Tyr Thr 65 70 75 80 Trp Val Asn Gly Thr Asp Leu Glu Leu Leu Lys Glu Leu Gln Gln Val 85 90 95 Arg Glu His Met Glu Glu Glu Gln Arg Ala Met Arg Glu Thr Leu Gly 100 105 110 Lys Asn Thr Thr Glu Pro Thr Lys Lys Ser Glu Lys Gln Leu Glu Cys 115 120 125 Leu Leu Thr His Cys Ile Lys Val Pro Met Leu Val Leu Asp Pro Ala 130 135 140 Leu Pro Ala Thr Ile Thr Leu Lys Asp Leu Pro Thr Leu Tyr Pro Ser 145 150 155 160 Phe His Ala Ser Ser Asp Met Phe Asn Val Ala Lys Pro Lys Asn Pro 165 170 175 Ser Thr Asn Val Pro Val Val Val Phe Asp Thr Thr Lys Asp Val Glu 180 185 190 Asp Ala His Ala Gly Pro Phe Lys Gly Gly Gln Gln Thr Asp Val Trp 195 200 205 Arg Ala Tyr Leu Thr Thr Asp Lys Asp Ala Pro Gly Leu Val Leu Ile 210 215 220 Gln Gly Leu Ala Phe Leu Ser Gly Phe Pro Pro Thr Phe Lys Glu Thr 225 230 235 240 Ser Gln Leu Lys Thr Lys Leu Pro Arg Lys Ala Phe Pro Leu Lys Ile 245 250 255 Lys Leu Leu Arg Leu Tyr Ser Glu Ala Ser Val Ala Leu Leu Lys Leu 260 265 270 Asn Asn Pro Lys Gly Phe Gln Glu Leu Asn Lys Gln Thr Lys Lys Asn 275 280 285 Met Thr Ile Asp Gly Lys Glu Leu Thr Ile Ser Pro Ala Tyr Leu Leu 290 295 300 Trp Asp Leu Ser Ala Ile Ser Gln Ser Lys Gln Asp Glu Asp Ala Ser 305 310 315 320 Ala Ser Arg Phe Glu Asp Asn Glu Glu Leu Arg Tyr Ser Leu Arg Ser 325 330 335 Ile Glu Arg His Ala Pro Trp Val Arg Asn Ile Phe Ile Val Thr Asn 340 345 350 Gly Gln Ile Pro Ser Trp Leu Asn Leu Asp Asn Pro Arg Val Thr Ile 355 360 365 Val Thr His Gln Asp Ile Phe Gln Asn Leu Ser His Leu Pro Thr Phe 370 375 380 Ser Ser Pro Ala Ile Glu Ser His Ile His Arg Ile Glu Gly Leu Ser 385 390 395 400 Gln Lys Phe Ile Tyr Leu Asn Asp Asp Val Met Phe Gly Lys Asp Val 405 410 415 Trp Pro Asp Asp Phe Tyr Ser His Ser Lys Gly Gln Lys Val Tyr Leu 420 425 430 Thr Trp Pro Val Pro Asn Cys Ala Glu Gly Cys Pro Gly Ser Trp Ile 435 440 445 Lys Asp Gly Tyr Cys Asp Lys Ala Cys Asn Thr Ser Pro Cys Asp Trp 450 455 460 Asp Gly Gly Asn Cys Ser Gly Asn Thr Ala Gly Asn Arg Phe Val Ala 465 470 475 480 Arg Gly Gly Gly Thr Gly Asn Ile Gly Ala Gly Gln His Trp Gln Phe 485 490 495 Gly Gly Gly Ile Asn Thr Ile Ser Tyr Cys Asn Gln Gly Cys Ala Asn 500 505 510 Ser Trp Leu Ala Asp Lys Phe Cys Asp Gln Ala Cys Asn Val Leu Ser 515 520 525 Cys Gly Phe Asp Ala Gly Asp Cys Gly Gln Asp His Phe His Glu Leu 530 535 540 Tyr Lys Val Thr Leu Leu Pro Asn Gln Thr His Tyr Val Val Pro Lys 545 550 555 560 Gly Glu Tyr Leu Ser Tyr Phe Ser Phe Ala Asn Ile Ala Arg Lys Arg 565 570 575 Ile Glu Gly Thr Tyr Ser Asp Asn Pro Ile Ile Arg His Ala Ser Ile 580 585 590 Ala Asn Lys Trp Lys Thr Leu His Leu Ile Met Pro Gly Gly Met Asn 595 600 605 Ala Thr Thr Ile Tyr Phe Asn Leu Thr Leu Gln Asn Ala Asn Asp Glu 610 615 620 Glu Phe Lys Ile Gln Ile Ala Val Glu Val Asp Thr Arg Glu Ala Pro 625 630 635 640 Lys Leu Asn Ser Thr Thr Gln Lys Ala Tyr Glu Ser Leu Val Ser Pro 645 650 655 Val Thr Pro Leu Pro Gln Ala Asp Val Pro Phe Glu Asp Val Pro Lys 660 665 670 Glu Lys Arg Phe Pro Lys Ile Arg Arg His Asp Val Asn Ala Thr Gly 675 680 685 Arg Phe Gln Glu Glu Val Lys Ile Pro Arg Val Asn Ile Ser Leu Leu 690 695 700 Pro Lys Glu Ala Gln Val Arg Leu Ser Asn Leu Asp Leu Gln Leu Glu 705 710 715 720 Arg Gly Asp Ile Thr Leu Lys Gly Tyr Asn Leu Ser Lys Ser Ala Leu 725 730 735 Leu Arg Ser Phe Leu Gly Asn Ser Leu Asp Thr Lys Ile Lys Pro Gln 740 745 750 Ala Arg Thr Asp Glu Thr Lys Gly Asn Leu Glu Val Pro Gln Glu Asn 755 760 765 Pro Ser His Arg Arg Pro His Gly Phe Ala Gly Glu His Arg Ser Glu 770 775 780 Arg Trp Thr Ala Pro Ala Glu Thr Val Thr Val Lys Gly Arg Asp His 785 790 795 800 Ala Leu Asn Pro Pro Pro Val Leu Glu Thr Asn Ala Arg Leu Ala Gln 805 810 815 Pro Thr Leu Gly Val Thr Val Ser Lys Glu Asn Leu Ser Pro Leu Ile 820 825 830 Val Pro Pro Glu Ser His Leu Pro Lys Glu Glu Glu Ser Asp Arg Ala 835 840 845 Glu Gly Asn Ala Val Pro Val Lys Glu Leu Val Pro Gly Arg Arg Leu 850 855 860 Gln Gln Asn Tyr Pro Gly Phe Leu Pro Trp Glu Lys Lys Lys Tyr Phe 865 870 875 880 Gln Asp Leu Leu Asp Glu Glu Glu Ser Leu Lys Thr Gln Leu Ala Tyr 885 890 895 Phe Thr Asp Arg Lys His Thr Gly Arg Gln Leu Lys 900 905 10 328 PRT Mus musculus 10 Asp Thr Phe Ala Asp Ser Leu Arg Tyr Val Asn Lys Ile Leu Asn Ser 1 5 10 15 Lys Phe Gly Phe Thr Ser Arg Lys Val Pro Ala His Met Pro His Met 20 25 30 Ile Asp Arg Ile Val Met Gln Glu Leu Gln Asp Met Phe Pro Glu Glu 35 40 45 Phe Asp Lys Thr Ser Phe His Lys Val Arg His Ser Glu Asp Met Gln 50 55 60 Phe Ala Phe Ser Tyr Phe Tyr Tyr Leu Met Ser Ala Val Gln Pro Leu 65 70 75 80 Asn Ile Ser Gln Val Phe His Glu Val Asp Thr Asp Gln Ser Gly Val 85 90 95 Leu Ser Asp Arg Glu Ile Arg Thr Leu Ala Thr Arg Ile His Asp Leu 100 105 110 Pro Leu Ser Leu Gln Asp Leu Thr Gly Leu Glu His Met Leu Ile Asn 115 120 125 Cys Ser Lys Met Leu Pro Ala Asn Ile Thr Gln Leu Asn Asn Ile Pro 130 135 140 Pro Thr Gln Glu Ala Tyr Tyr Asp Pro Asn Leu Pro Pro Val Thr Lys 145 150 155 160 Ser Leu Val Thr Asn Cys Lys Pro Val Thr Asp Lys Ile His Lys Ala 165 170 175 Tyr Lys Asp Lys Asn Lys Tyr Arg Phe Glu Ile Met Gly Glu Glu Glu 180 185 190 Ile Ala Phe Lys Met Ile Arg Thr Asn Val Ser His Val Val Gly Gln 195 200 205 Leu Asp Asp Ile Arg Lys Asn Pro Arg Lys Phe Val Cys Leu Asn Asp 210 215 220 Asn Ile Asp His Asn His Lys Asp Ala Arg Thr Val Lys Ala Val Leu 225 230 235 240 Arg Asp Phe Tyr Glu Ser Met Phe Pro Ile Pro Ser Gln Phe Glu Leu 245 250 255 Pro Arg Glu Tyr Arg Asn Arg Phe Leu His Met His Glu Leu Gln Glu 260 265 270 Trp Arg Ala Tyr Arg Asp Lys Leu Lys Phe Trp Thr His Cys Val Leu 275 280 285 Ala Thr Leu Ile Ile Phe Thr Ile Phe Ser Phe Phe Ala Glu Gln Ile 290 295 300 Ile Ala Leu Lys Arg Lys Ile Phe Pro Arg Arg Arg Ile His Lys Glu 305 310 315 320 Ala Ser Pro Asp Arg Ile Arg Val 325 11 2070 DNA Mus musculus misc_feature (186)..(186) n is a, t, c, or g 11 gtgagaccct aggagcaatg gccgggcggc tggctggctt cctgatgttg ctggggctcg 60 cgtcgcaggg gcccgcgccg gcatgtgccg ggaagatgaa ggtggtggag gagcctaaca 120 cattcgggtg agcggatcac ggtcctgcgg cttggggacc gagcctggct ggttcttctg 180 accttntcaa ttccataggc tgaataaccc gttcttgccc caggcaagcc gccttcagcc 240 caagagagag ccttcagctg tatcccgcaa attaagagaa attaatttca aacgatttag 300 aaagtattct agccaggcga tgatggcgca cgcctttaat cccagcactt gggaggcaga 360 ggcaggcaga tttccgagtt caaggccatc agaactgact gtacatctta gtacagttta 420 gcatgtgatc agagatctga atcacaaagc tgggcctgcg tggtaaagca ggtcctttct 480 aataaggttg cagtttagat tttctttctt aactctttta ttctttgaga cagggtttct 540 caacagtggg tgtcctggaa ctcacttttg taaaccaggc tgcccttaaa ctcacaaagc 600 tctgtcagcc tctgcctcct gagtgctggg attaaaggtc cacaccctgt tcattcattt 660 ttaatttttg agactgggtc tcattatgtg gccctagaca gatactgaga gcctcctcca 720 caggaacaag catgggaatc ctgccacaga caaccagttc tgtggtctgg agatgagttt 780 gtcagtccct aggagttagg tcagcctgcc tctgcattcc caataattta ggaaaggagc 840 ttggggcgtt ctggccttga tggttagtgc cctcctgcca accttagctt ccagctttag 900 gggtagcaga gtttataccg atgctaaact gctgttgtgt tcttccccag ggcccctgca 960 tctcttcaga cttgctggca agtgctttag cctagtggag tccacgtgag tgccaggctg 1020 gtgggtggag tgggcggagt ctgcagagct cctgatgtgc ctgtgtttcc caggtacaag 1080 tatgaattct gccctttcca caacgtcacc cagcacgagc agaccttccg ctggaatgcc 1140 tacagcggga tccttggcat ctggcatgag tgggaaatca tcaacaatac cttcaagggc 1200 atgtggatga ctgatgggga ctcctgccac tcccggagcc ggcagagcaa ggtggagctc 1260 acctgtggaa agatcaaccg actggcccac gtgtctgagc caagcacctg tgtctatgca 1320 ttgacattcg agacccctct tgtttgccat ccccactctt tgttagtgta tccaactctg 1380 tcagaagccc tgcagcagcc cttggaccag gtggaacagg acctggcaga tgaactgatc 1440 acaccacagg gctatgagaa gttgctaagg gtactttttg aggatgctgg ctacttaaag 1500 gtcccaggag aaacccatcc cacccagctg gcaggaggtt ccaagggcct ggggcttgag 1560 actctggaca actgtagaaa ggcacatgca gagctgtcac aggaggtaca aagactgacg 1620 agtctgctgc aacagcatgg aatcccccac actcagccca caggtcagtc tgcctgccct 1680 ggtcagctgc cagccactcc ggggcctgca gcactggggc agatctttat tgctacccat 1740 tctggcagaa accactcact ctcagcacct gggtcagcag ctccccatag gtgcaatcgc 1800 agcagagcat ctgcggagtg acccaggact acgtgggaac atcctgtgag caaggtggcc 1860 acgaagaata gaaatatcct gagctttgag tgtcctttca cagagtgaac aaaactggtg 1920 tggtgtagac acggcttctt ttggcatatt ctagatcaga cagtgtcact gacaaacaag 1980 agggacctgc tggccagcct ttgttgtgcc caaagatcca gacaaaataa agattcaaag 2040 ttttaattaa aaaaaaaaaa aaaggaattc 2070 12 307 PRT Mus musculus 12 Met Ala Gly Arg Leu Ala Gly Phe Leu Met Leu Leu Gly Leu Ala Ser 1 5 10 15 Gln Gly Pro Ala Pro Ala Cys Ala Gly Lys Met Lys Val Val Glu Glu 20 25 30 Pro Asn Thr Phe Gly Leu Asn Asn Pro Phe Leu Pro Gln Ala Ser Arg 35 40 45 Leu Gln Pro Lys Arg Glu Pro Ser Ala Val Ser Gly Pro Leu His Leu 50 55 60 Phe Arg Leu Ala Gly Lys Cys Phe Ser Leu Val Glu Ser Thr Tyr Lys 65 70 75 80 Tyr Glu Phe Cys Pro Phe His Asn Val Thr Gln His Glu Gln Thr Phe 85 90 95 Arg Trp Asn Ala Tyr Ser Gly Ile Leu Gly Ile Trp His Glu Trp Glu 100 105 110 Ile Ile Asn Asn Thr Phe Lys Gly Met Trp Met Thr Asp Gly Asp Ser 115 120 125 Cys His Ser Arg Ser Arg Gln Ser Lys Val Glu Leu Thr Cys Gly Lys 130 135 140 Ile Asn Arg Leu Ala His Val Ser Glu Pro Ser Thr Cys Val Tyr Ala 145 150 155 160 Leu Thr Phe Glu Thr Pro Leu Val Cys His Pro His Ser Leu Leu Val 165 170 175 Tyr Pro Thr Leu Ser Glu Ala Leu Gln Gln Arg Leu Asp Gln Val Glu 180 185 190 Gln Asp Leu Ala Asp Glu Leu Ile Thr Pro Gln Gly Tyr Glu Lys Leu 195 200 205 Leu Arg Val Leu Phe Glu Asp Ala Gly Tyr Leu Lys Val Pro Gly Glu 210 215 220 Thr His Pro Thr Gln Leu Ala Gly Gly Ser Lys Gly Leu Gly Leu Glu 225 230 235 240 Thr Leu Asp Asn Cys Arg Lys Ala His Ala Glu Leu Ser Gln Glu Val 245 250 255 Gln Arg Leu Thr Ser Leu Leu Gln Gln His Gly Ile Pro His Thr Gln 260 265 270 Pro Thr Glu Thr Thr His Ser Gln His Leu Gly Gln Gln Leu Pro Ile 275 280 285 Gly Ala Ile Ala Ala Glu His Leu Arg Ser Asp Pro Gly Leu Arg Gly 290 295 300 Asn Ile Leu 305 13 460 DNA Rattus rattus 13 attcccacca acattcaagg agacgagtca gctgaagaca aaactgccag aaaatctttc 60 ttctaaaata aaactgttgc agctgtactc ggaggccagc gtcgctcttc tgaaattgaa 120 taaccccaaa ggtttccccg agctgaacaa gcagaccaag aagaacatga gcatcagtgg 180 gaaggaactg gccatcagcc ctgcctatct gctgtgggac ctgagcgcca tcagccagtc 240 caagcaggat gaagatgtgt ctgccagccg cttcgaggat aacgaagagc tgaggtactc 300 actgagatct atcgagagac atgattccat gagtccttta tgaattctgg ccatatcttc 360 aatcatgatc tcagtagtat tcctctgaaa tggcacacat ttttctaatg agaacttgaa 420 atgtaaatat tgtgtttgtg ctgtaaattt tgtgtatttc 460 14 113 PRT Rattus rattus 14 Phe Pro Pro Thr Phe Lys Glu Thr Ser Gln Leu Lys Thr Lys Leu Pro 1 5 10 15 Glu Asn Leu Ser Ser Lys Ile Lys Leu Leu Gln Leu Tyr Ser Glu Ala 20 25 30 Ser Val Ala Leu Leu Lys Leu Asn Asn Pro Lys Gly Phe Pro Glu Leu 35 40 45 Asn Lys Gln Thr Lys Lys Asn Met Ser Ile Ser Gly Lys Glu Leu Ala 50 55 60 Ile Ser Pro Ala Tyr Leu Leu Trp Asp Leu Ser Ala Ile Ser Gln Ser 65 70 75 80 Lys Gln Asp Glu Asp Val Ser Ala Ser Arg Phe Glu Asp Asn Glu Glu 85 90 95 Leu Arg Tyr Ser Leu Arg Ser Ile Glu Arg His Asp Ser Met Ser Pro 100 105 110 Leu 15 1105 DNA Drosophila melanogaster misc_feature (903)..(903) n is a, g, t, or c 15 ctgcaggaat tcggcacgag gcggttcgat gacaagaatg agctgcggta ctctctgagg 60 tccctggaaa aacacgccgc atggatcagg catgtgtaca tagtaaccaa tggccagatt 120 ccaagttggc tggatctcag ctacgaaagg gtcacggtgg tgccccacga agtcctggct 180 cccgatcccg accagctgcc caccttctcc agctcggcca tcgagacatt tctgcaccgc 240 ataccaaagc tgtccaagag gttcctctac ctcaacgacg acatattcct gggagctccg 300 ctgtatccgg aggacttgta cactgaagcg gagggagttc gcgtgtacca ggcatggatg 360 gtgcccggct gcgccttgga ttgcccctgg acgtacatag gtgatggagc ttgcgatcgg 420 cactgcaaca ttgatgcgtg ccaatttgat ggaggcgact gcagtgaaac tgggccagcg 480 agcgatgccc acgtcattcc accaagcaaa gaagtgctcg aggtgcagcc tgccgctgtt 540 ccacaatcaa gagtccaccg atttcctcag atgggtctcc aaaagctgtt caggcgcagc 600 tctgccaatt ttaaggatgt tatgcggcac cgcaatgtgt ccacactcaa ggaactacgt 660 cgcattgtgg agcgttttaa caaggccaaa ctcatgtcgc tgaaccccga actggagacc 720 tccagctccg agccacagac aactcagcgc cacgggctgc gcaaggagga ttttaagtct 780 tccaccgata tttactctca ctcgctgatt gccaccaata tgttgctgaa tagagcctat 840 ggctttaagg cacgccatgt cctggcgcac gtgggcttcc taattgacaa ggatattgtg 900 gangccatgc aacgacgttt taccagcgaa ttctngacac tggccattaa cgctttccga 960 gccccaacag atttgcagta cgcattcgct tactacttct ttctaatgag cgaaatccaa 1020 gtnatgagtg tagangaaat cttcgatgaa gtcgacaccg gacggtttgg ncacctggtc 1080 ggatccagaa gtgcgaaccn tttta 1105 16 502 PRT Drosophila melanogaster 16 Gly Thr Arg Arg Phe Asp Asp Lys Asn Glu Leu Arg Tyr Ser Leu Arg 1 5 10 15 Ser Leu Glu Lys His Ala Ala Trp Ile Arg His Val Tyr Ile Val Thr 20 25 30 Asn Gly Gln Ile Pro Ser Trp Leu Asp Leu Ser Tyr Glu Arg Val Thr 35 40 45 Val Val Pro His Glu Val Leu Ala Pro Asp Pro Asp Gln Leu Pro Thr 50 55 60 Phe Ser Ser Ser Ala Ile Glu Thr Phe Leu His Arg Ile Pro Lys Leu 65 70 75 80 Ser Lys Arg Phe Leu Tyr Leu Asn Asp Asp Ile Phe Leu Gly Ala Pro 85 90 95 Leu Tyr Pro Glu Asp Leu Tyr Thr Glu Ala Glu Gly Val Arg Val Tyr 100 105 110 Gln Ala Trp Met Val Pro Gly Cys Ala Leu Asp Cys Pro Trp Thr Tyr 115 120 125 Ile Gly Asp Gly Ala Cys Asp Arg His Cys Asn Ile Asp Ala Cys Gln 130 135 140 Phe Asp Gly Gly Asp Cys Ser Glu Thr Gly Pro Ala Ser Asp Ala His 145 150 155 160 Val Ile Pro Pro Ser Lys Glu Val Leu Glu Val Gln Pro Ala Ala Val 165 170 175 Pro Gln Ser Arg Val His Arg Phe Pro Gln Met Gly Leu Gln Lys Leu 180 185 190 Phe Arg Arg Ser Ser Ala Asn Phe Lys Asp Val Met Arg His Arg Asn 195 200 205 Val Ser Thr Leu Lys Glu Leu Arg Arg Ile Val Glu Arg Phe Asn Lys 210 215 220 Ala Lys Leu Met Ser Leu Asn Pro Glu Leu Glu Thr Ser Ser Ser Glu 225 230 235 240 Pro Gln Thr Thr Gln Arg His Gly Leu Arg Lys Glu Asp Phe Lys Ser 245 250 255 Ser Thr Asp Ile Tyr Ser His Ser Leu Ile Ala Thr Asn Met Leu Leu 260 265 270 Asn Arg Ala Tyr Gly Phe Lys Ala Arg His Val Leu Ala His Val Gly 275 280 285 Phe Leu Ile Asp Lys Asp Ile Val Glu Ala Met Gln Arg Arg Phe His 290 295 300 Gln Gln Ile Leu Asp Thr Ala His Gln Arg Phe Arg Ala Pro Thr Asp 305 310 315 320 Leu Gln Tyr Ala Phe Ala Tyr Tyr Ser Phe Leu Met Ser Glu Thr Lys 325 330 335 Val Met Ser Val Glu Glu Ile Phe Asp Glu Phe Asp Thr Asp Gly Ser 340 345 350 Ala Thr Trp Ser Asp Arg Glu Val Arg Thr Phe Leu Thr Arg Ile Tyr 355 360 365 Gln Pro Pro Leu Asp Trp Ser Ala Met Arg Tyr Phe Glu Glu Val Val 370 375 380 Gln Asn Cys Thr Arg Asn Leu Gly Met His Leu Lys Val Asp Thr Val 385 390 395 400 Glu His Ser Thr Leu Val Tyr Glu Arg Tyr Glu Asp Ser Asn Leu Pro 405 410 415 Thr Ile Thr Arg Asp Leu Val Val Arg Cys Pro Leu Leu Ala Glu Ala 420 425 430 Leu Ala Ala Asn Phe Ala Val Arg Pro Lys Tyr Asn Phe His Val Ser 435 440 445 Pro Lys Arg Thr Ser His Ser Asn Phe Met Met Leu Thr Ser Asn Leu 450 455 460 Thr Glu Val Val Glu Ser Leu Asp Arg Leu Arg Arg Asn Pro Arg Lys 465 470 475 480 Phe Asn Cys Ile Asn Asp Asn Leu Asp Ala Asn Arg Gly Glu Asp Asn 485 490 495 Glu Asp Gly Ala Pro Ser 500 17 2183 DNA Homo sapiens 17 atggcgacct ccacgggtcg ctggcttctc ctccggcttg cactattcgg cttcctctgg 60 gaagcgtccg gcggcctcga ctcgggggcc tcccgcgacg acgacttgct actgccctat 120 ccacgcgcgc gcgcgcgcct cccccgggac tgcacacggg tgcgcgccgg caaccgcgag 180 cacgagagtt ggcctccgcc tcccgcgact cccggcgccg gcggtctggc cgtgcgcacc 240 ttcgtgtcgc acttcaggga ccgcgcggtg gccggccacc tgacgcgggc cgttgagccc 300 ctgcgcacct tctcggtgct ggagcccggt ggacccggcg gctgcgcggc gagacgacgc 360 gccaccgtgg aggagacggc gcgggcggcc gactgccgtg tcgcccagaa cggcggcttc 420 ttccgcatga actcgggcga gtgcctgggg aacgtggtga gcgacgagcg gcgggtgagc 480 agctccgggg ggctgcagaa cgcgcagttc gggatccgcc gcgacgggac cctggtcacc 540 gggtacctgt ctgaggagga ggtgctggac actgagaacc catttgtgca gctgctgagt 600 ggggtcgtgt ggctgattcg taatggaagc atctacatca acgagagcca agccacagag 660 tgtgacgaga cacaggagac aggttccttt agcaaatttg tgaatgtgat atcagccagg 720 acggccattg gccacgaccg gaaagggcag ctggtgctct ttcatgcaga cggccatacg 780 gagcagcgtg gcatcaacct gtgggaaatg gcggagttcc tgctgaaaca ggacgtggtc 840 aacgccatca acctggatgg gggtggctct gccacctttg tgctcaacgg gaccttggcc 900 agttacccgt cagatcactg ccaggacaac atgtggcgct gtccccgcca agtgtccacc 960 gtggtgtgtg tgcacgaacc ccgctgccag ccgcctgact gccacggcca cgggacctgc 1020 gtggacgggc actgccaatg caccgggcac ttctggcggg gtcccggctg tgatgagctg 1080 gactgtggcc cctctaactg cagccagcac ggactgtgca cggagaccgg ctgccgctgt 1140 gatgccggat ggaccgggtc caactgcagt gaagagtgtc cccttggctg gcatgggccg 1200 ggctgccaga ggcgttgtaa gtgtgagcac cattgtccct gtgaccccaa gactggcaac 1260 tgcagcgtct ccagagtaaa gcagtgtctc cagccacctg aagccaccct gagggcggga 1320 gaactctcct ttttcaccag gaccgcctgg ctagccctca ccctggcgct ggccttcctc 1380 ctgctgatca gcattgcagc aaacctgtcc ttgctcctgt ccagagcaga gaggaaccgg 1440 cgcctgcatg gggactatgc ataccacccg ctgcaggaga tgaacgggga gcctctggcc 1500 gcagagaagg agcagccagg gggcgcccac aaccccttca aggactgaag cctcaagctg 1560 cccggggtgg cacgtcgcga aagcttgttt ccccacggtc tggcttctgc aggggaaatt 1620 tcaaggccac tggcgtggac catctgggtg tcctcaatgg cccctgtggg gcagccaagt 1680 tcctgatagc acttgtgcct cagcccctca cctggccacc tgccagggca cctgcaaccc 1740 tagcaatacc atgctcgctg gagaggctca gctgcctgct tctcgcctgc ctgtgtctgc 1800 tgccgagaag cccgtgcccc cgggagggct gccgcactgc caaagagtct ccctcctcct 1860 ggggaagggg ctgccaacga accagactca gtgaccacgt catgacagaa cagcacatcc 1920 tggccagcac ccctggctgg agtgggttaa agggacgagt ctgccttcct ggctgtgaca 1980 cgggacccct tttctacaga cctcatcact ggatttgcca actagaattc gatttcctgt 2040 cataggaagc tccttggaag aagggatggg gggatgaaat catgtttaca gacctgtttt 2100 gtcatcctgc tgccaagaag ttttttaatc acttgaataa attgatataa taaaaggagc 2160 caccaggtgg tgtgtggatt ctg 2183 18 515 PRT Homo sapiens 18 Met Ala Thr Ser Thr Gly Arg Trp Leu Leu Leu Arg Leu Ala Leu Phe 1 5 10 15 Gly Phe Leu Trp Glu Ala Ser Gly Gly Leu Asp Ser Gly Ala Ser Arg 20 25 30 Asp Asp Asp Leu Leu Leu Pro Tyr Pro Arg Ala Arg Ala Arg Leu Pro 35 40 45 Arg Asp Cys Thr Arg Val Arg Ala Gly Asn Arg Glu His Glu Ser Trp 50 55 60 Pro Pro Pro Pro Ala Thr Pro Gly Ala Gly Gly Leu Ala Val Arg Thr 65 70 75 80 Phe Val Ser His Phe Arg Asp Arg Ala Val Ala Gly His Leu Thr Arg 85 90 95 Ala Val Glu Pro Leu Arg Thr Phe Ser Val Leu Glu Pro Gly Gly Pro 100 105 110 Gly Gly Cys Ala Ala Arg Arg Arg Ala Thr Val Glu Glu Thr Ala Arg 115 120 125 Ala Ala Asp Cys Arg Val Ala Gln Asn Gly Gly Phe Phe Arg Met Asn 130 135 140 Ser Gly Glu Cys Leu Gly Asn Val Val Ser Asp Glu Arg Arg Val Ser 145 150 155 160 Ser Ser Gly Gly Leu Gln Asn Ala Gln Phe Gly Ile Arg Arg Asp Gly 165 170 175 Thr Leu Val Thr Gly Tyr Leu Ser Glu Glu Glu Val Leu Asp Thr Glu 180 185 190 Asn Pro Phe Val Gln Leu Leu Ser Gly Val Val Trp Leu Ile Arg Asn 195 200 205 Gly Ser Ile Tyr Ile Asn Glu Ser Gln Ala Thr Glu Cys Asp Glu Thr 210 215 220 Gln Glu Thr Gly Ser Phe Ser Lys Phe Val Asn Val Ile Ser Ala Arg 225 230 235 240 Thr Ala Ile Gly His Asp Arg Lys Gly Gln Leu Val Leu Phe His Ala 245 250 255 Asp Gly His Thr Glu Gln Arg Gly Ile Asn Leu Trp Glu Met Ala Glu 260 265 270 Phe Leu Leu Lys Gln Asp Val Val Asn Ala Ile Asn Leu Asp Gly Gly 275 280 285 Gly Ser Ala Thr Phe Val Leu Asn Gly Thr Leu Ala Ser Tyr Pro Ser 290 295 300 Asp His Cys Gln Asp Asn Met Trp Arg Cys Pro Arg Gln Val Ser Thr 305 310 315 320 Val Val Cys Val His Glu Pro Arg Cys Gln Pro Pro Asp Cys His Gly 325 330 335 His Gly Thr Cys Val Asp Gly His Cys Gln Cys Thr Gly His Phe Trp 340 345 350 Arg Gly Pro Gly Cys Asp Glu Leu Asp Cys Gly Pro Ser Asn Cys Ser 355 360 365 Gln His Gly Leu Cys Thr Glu Thr Gly Cys Arg Cys Asp Ala Gly Trp 370 375 380 Thr Gly Ser Asn Cys Ser Glu Glu Cys Pro Leu Gly Trp His Gly Pro 385 390 395 400 Gly Cys Gln Arg Arg Cys Lys Cys Glu His His Cys Pro Cys Asp Pro 405 410 415 Lys Thr Gly Asn Cys Ser Val Ser Arg Val Lys Gln Cys Leu Gln Pro 420 425 430 Pro Glu Ala Thr Leu Arg Ala Gly Glu Leu Ser Phe Phe Thr Arg Thr 435 440 445 Ala Trp Leu Ala Leu Thr Leu Ala Leu Ala Phe Leu Leu Leu Ile Ser 450 455 460 Ile Ala Ala Asn Leu Ser Leu Leu Leu Ser Arg Ala Glu Arg Asn Arg 465 470 475 480 Arg Leu His Gly Asp Tyr Ala Tyr His Pro Leu Gln Glu Met Asn Gly 485 490 495 Glu Pro Leu Ala Ala Glu Lys Glu Gln Pro Gly Gly Ala His Asn Pro 500 505 510 Phe Lys Asp 515 19 2005 DNA Mus musculus 19 gtttcccgcg acgatgacct gctgctgcct tacccactag cgcgcagacg tccctcgcga 60 gactgcgccc gggtgcgctc aggtagccca gagcaggaga gctggcctcc gccacctctg 120 gccacccacg aaccccgggc gccaagccac cacgcggccg tgcgcacctt cgtgtcgcac 180 ttcgaggggc gcgcggtggc cggccacctg acgcgggtcg ccgatcccct acgcactttc 240 tcggtgctgg agcccggagg agccgggggc tgcggcggca gaagcgccgc ggctactgtg 300 gaggacacag ccgtccgggc cggttgccgc atcgctcaga acggtggctt cttccgcatg 360 agcactggcg agtgcttggg gaacgtggtg agcgacgggc ggctggtgag cagctcaggg 420 ggactgcaga acgcgcagtt cggtatccga cgcgatggaa ccatagtcac cgggtcctgt 480 cttgaagaag aggttctgga tcccgtgaat ccgttcgtgc agctgctgag cggagtcgtg 540 tggctcatcc gcaatggaaa catctacatc aacgagagcc aagccatcga gtgtgacgag 600 acacaggaga caggttcttt tagcaaattt gtgaatgtga tgtcagccag gacagccgtg 660 ggtcatgacc gtgaggggca gcttatcctc ttccatgctg atggacagac ggaacagcgt 720 ggccttaacc tatgggagat ggcagagttc ctgcgtcaac aagatgtcgt caatgccatc 780 aacctggatg gaggcggttc tgctactttt gtgctcaatg ggaccctggc cagttaccct 840 tcagatcact gccaggacaa catgtggcgc tgtccccgcc aagtgtccac tgtggtgtgt 900 gtgcatgaac cgcgctgcca gccacccgac tgcagtggcc atgggacctg tgtggatggc 960 cactgtgaat gcaccagcca cttctggcgg ggcgaggcct gcagcgagct ggactgtggc 1020 ccctccaact gcagccagca tgggctgtgc acagctggct gccactgtga tgctgggtgg 1080 acaggatcca actgcagtga agagtgtcct ctgggctggt atgggccagg ttgccagagg 1140 ccctgccagt gtgagcacca gtgtttctgt gacccgcaga ctggcaactg cagcatctcc 1200 caagtgaggc agtgtctcca gccaactgag gctacgccga gggcaggaga gctggcctct 1260 ttcaccagga ccacctggct agccctcacc ctgacactaa ttttcctgct gctgatcagc 1320 actggggtca acgtgtcctt gttcctgggc tccagggccg agaggaaccg gcacctcgac 1380 ggggactatg tgtatcaccc actgcaggag gtgaacgggg aagcgctgac tgcagagaag 1440 gagcacatgg aggaaactag caaccccttc aaggactgaa gagctgcccc aacggcatgc 1500 tccagataat cttgtccctg ctcctcactt ccacagggga cattgtgagg ccactggcat 1560 ggatgctatg caccccaccc tttgctggcc atattcctcc tgtccccatg ctgtggctca 1620 tgccaaccta gcaataagga gctctggaga gcctgcacct gcctcccgct cgcctatatc 1680 tgctgcccag aggcctgtct cgcacagggg tctcgccact gccaaagact cccaggaagt 1740 caaagactcc cagtaatcca ctagcaaatg gaactctgta acgccatcat aacaagagtg 1800 gccactctcc gcgtgcacag gtatgaaata taaatcctta cacacacaca cacacacacc 1860 ctcggctcag ccacggcact cgccttttat acagcgtcat cgctggacag ccaactagaa 1920 ctctgcatcc tgtcacagga agcacctcat aagaaggaat ggggagggaa ggcagtcgcc 1980 ttgttttcag accttagccg aattc 2005 20 492 PRT Mus musculus 20 Val Ser Arg Asp Asp Asp Leu Leu Leu Pro Tyr Pro Leu Ala Arg Arg 1 5 10 15 Arg Pro Ser Arg Asp Cys Ala Arg Val Arg Ser Gly Ser Pro Glu Gln 20 25 30 Glu Ser Trp Pro Pro Pro Pro Leu Ala Thr His Glu Pro Arg Ala Pro 35 40 45 Ser His His Ala Ala Val Arg Thr Phe Val Ser His Phe Glu Gly Arg 50 55 60 Ala Val Ala Gly His Leu Thr Arg Val Ala Asp Pro Leu Arg Thr Phe 65 70 75 80 Ser Val Leu Glu Pro Gly Gly Ala Gly Gly Cys Gly Gly Arg Ser Ala 85 90 95 Ala Ala Thr Val Glu Asp Thr Ala Val Arg Ala Gly Cys Arg Ile Ala 100 105 110 Gln Asn Gly Gly Phe Phe Arg Met Ser Thr Gly Glu Cys Leu Gly Asn 115 120 125 Val Val Ser Asp Gly Arg Leu Val Ser Ser Ser Gly Gly Leu Gln Asn 130 135 140 Ala Gln Phe Gly Ile Arg Arg Asp Gly Thr Ile Val Thr Gly Ser Cys 145 150 155 160 Leu Glu Glu Glu Val Leu Asp Pro Val Asn Pro Phe Val Gln Leu Leu 165 170 175 Ser Gly Val Val Trp Leu Ile Arg Asn Gly Asn Ile Tyr Ile Asn Glu 180 185 190 Ser Gln Ala Ile Glu Cys Asp Glu Thr Gln Glu Thr Gly Ser Phe Ser 195 200 205 Lys Phe Val Asn Val Met Ser Ala Arg Thr Ala Val Gly His Asp Arg 210 215 220 Glu Gly Gln Leu Ile Leu Phe His Ala Asp Gly Gln Thr Glu Gln Arg 225 230 235 240 Gly Leu Asn Leu Trp Glu Met Ala Glu Phe Leu Arg Gln Gln Asp Val 245 250 255 Val Asn Ala Ile Asn Leu Asp Gly Gly Gly Ser Ala Thr Phe Val Leu 260 265 270 Asn Gly Thr Leu Ala Ser Tyr Pro Ser Asp His Cys Gln Asp Asn Met 275 280 285 Trp Arg Cys Pro Arg Gln Val Ser Thr Val Val Cys Val His Glu Pro 290 295 300 Arg Cys Gln Pro Pro Asp Cys Ser Gly His Gly Thr Cys Val Asp Gly 305 310 315 320 His Cys Glu Cys Thr Ser His Phe Trp Arg Gly Glu Ala Cys Ser Glu 325 330 335 Leu Asp Cys Gly Pro Ser Asn Cys Ser Gln His Gly Leu Cys Thr Ala 340 345 350 Gly Cys His Cys Asp Ala Gly Trp Thr Gly Ser Asn Cys Ser Glu Glu 355 360 365 Cys Pro Leu Gly Trp Tyr Gly Pro Gly Cys Gln Arg Pro Cys Gln Cys 370 375 380 Glu His Gln Cys Phe Cys Asp Pro Gln Thr Gly Asn Cys Ser Ile Ser 385 390 395 400 Gln Val Arg Gln Cys Leu Gln Pro Thr Glu Ala Thr Pro Arg Ala Gly 405 410 415 Glu Leu Ala Ser Phe Thr Arg Thr Thr Trp Leu Ala Leu Thr Leu Thr 420 425 430 Leu Ile Phe Leu Leu Leu Ile Ser Thr Gly Val Asn Val Ser Leu Phe 435 440 445 Leu Gly Ser Arg Ala Glu Arg Asn Arg His Leu Asp Gly Asp Tyr Val 450 455 460 Tyr His Pro Leu Gln Glu Val Asn Gly Glu Ala Leu Thr Ala Glu Lys 465 470 475 480 Glu His Met Glu Glu Thr Ser Asn Pro Phe Lys Asp 485 490 21 9792 DNA Mus musculus 21 caggctcggg acttactata acacaggaca cttgtcacct gaaagcttga gtcagtcagt 60 tattatggtc tgtgtgtgag atacaagtgg gtgcataggc agtggtgcac acatgtagat 120 cagactttct acagccaatt ctcttcttcc tcctctccat gggttcaggg tcttcatctc 180 aggttgcaca gcgagttcat ttatgtgctg tgccatctcg ccagtcgttc ctatatccta 240 gaggaaaact agtttcttct ggtcaagagg aggaaagagt ggagacctgt cattctaaga 300 tacccaaaac agggccaggt tggggacctg tgcctttaat cccatcactt ggggattagg 360 tagaagcaag aggctctaga ccagtctaca cactgaattt caagccagcc tacctataaa 420 tcagagaccc tgcttcaaaa ataaaattaa acaaaaacga agataaacca agctacccaa 480 aacacaagag ttaatccagt cagacaggtc tagcaaatgc taggatgaaa ggtgtgcacc 540 accacgagtg ggctgcaagc ctctctctct ctctctctct ctctctctct ctcgtttgtt 600 ttgtttttcg agacaaggtt tctctgtgta gccctggctg tcctggaact cactctgtag 660 accaggctgg cctcgagctt cactcttaaa agttcctctt cctcctcctc catcttttcc 720 tcctcttacc ccctaggctc cttttcctct tcttgtcttt cagataaagt ctcaagtagt 780 ccagactggt ctcaaactaa ctaactagcc aagaatagcc aacctcttaa cttccgattc 840 tcctgcctct gctgaatgct ggggttgtgg cgtgggccac cacttctggt ttgtgcaaca 900 cagaaggaac tagggcttta agcacgagaa gcaagttctg tacagactta cacaggccca 960 gcatctgttc ttgcaatttt ctgtaagttt gacataatat gagaataaaa agctatctat 1020 ctcccttcca gccttaccct ctctgatgga attcgaatgc gtaatcaaag cacccaacag 1080 cctggcctga aatcacgtgg ggcaagccca cgtgaccgga gcaccaatcc aatatggcgg 1140 cgcccagggg gcccgggctg ttcctcatac ccgcgctgct cggcttactc ggggtggcgt 1200 ggtgcagctt aagcttcggg tgagtgcaag ccgccggggc cagcctggct ggggtccacc 1260 tttcctgagc gctctcaggc acagccctcc gacctcacga tcgccccgtc cctgcagggt 1320 ttcccgcgac gatgacctgc tgctgcctta cccactagcg cgcagacgtc cctcgcgaga 1380 ctgcgcccgg gtgcgctcag gtagcccaga gcaggagagc tggcctccgc cacctctggc 1440 cacccacgaa ccccgggcgc caagccacca cgcggccgtg cgcaccttcg tgtcgcactt 1500 cgaggggcgc gcggtggccg gccacctgac gcgggtcgcc gatcccctac gcactttctc 1560 ggtgctggag cccggaggag ccgggggctg cggcggcaga agcgccgcgg ctactgtgga 1620 ggacacagcc gtccgggccg gttgccgcat cgctcagaac ggtggcttct tccgcatgag 1680 cactggcgag tgcttgggga acgtggtgag cgacgggcgg ctggtgagca gctcaggggg 1740 actgcagaac gcgcagttcg gtatccgacg cgatggaacc atagtcaccg ggtgaggagg 1800 cagggagccc cggggctgta gagggcaaag ggtctctgat gttctttcag agccatgcct 1860 ccgagtccag gtccctaacc aaacttcctg tctttcttct tccgagtaat gacgctgaca 1920 ccttccttcc tttaagttta ttcatgtgcc actgaataat ctgtgatcag gccgtgtgtg 1980 gggacttggg gaggcgaccg tgagcctgaa cacagtttgt gccctagtga actttgtgta 2040 gtattagaga aacatttcgt gttcaacgaa gccatggaac caattggaaa tagtgtagag 2100 tttatggagc agtcccagac agctagctgg aggccttttg ctgtcctgat aaaaatccag 2160 gttagacaag gagcttgttg agggcagcct ttggaagttt ctgtgtttct tgaaatttga 2220 cagcagccag agttgacagc aggcaggcag gagtagaagg tagcgccatc tggtgttcca 2280 gttctcttcc aaggttccgt tttttgccaa ggctgggaag tgggctttcc ccaactcttc 2340 tcagcccttg gttgcaattt ctgggcctgc ccatgtatct ggttcttcat ccttcaacat 2400 cagccagtgt caccactgtt gatcttaggt tttcacagat cctaaaactt ctgccagtga 2460 ccagcgcctg cagtttctct tccctggctc tgtccttcaa cctctctaca ttccagccat 2520 ctccctagct cctctcttgg actccctttc agacttgttg tcatgatcac tgtctcagaa 2580 cccctattgc tcctttacaa tggtccactg acctgctcac ctcctacttt ttttttttaa 2640 atgtgtgtgc atctgtgtgt gcctgagggg agaccagagt ttgatttcaa atgtcttcta 2700 ttctcttttc ctccatctta ttttctaaca caaaatctga atctagagat cactggttca 2760 gttaacctgg ctggccggta aaccccaggg ccctcctgct tccctctgtc caccccaccc 2820 cagcactaag gctacagtgt gtgctgttcc agccagcttt ctcatgggtg ctgaggatct 2880 gaacgcaggt tcacatgtgt ggtgggaagg cttttaccca atgctctgtc tttccagccc 2940 atcctccctt gttaactgcc aaacagctgc ctatcctgtc catgtgtagc tcactgctac 3000 ttcttttatt atgaggtcag cacatgttac taaagatggc aagagaagaa ggttctttca 3060 ttgtgtcata gctatagctc aggaggaatt ttatttcctg tgtaggcaca caggagagca 3120 tcttccagct cacactccaa ctgaactaac tgaacacctg cctatatatc caaagaaggg 3180 gtgtcagtgc caatcacagc acacctccag tgcaaatgaa ggtttgtgtt tgcaccaatc 3240 acagccttgc ctcttttagc atgcatcaca acaaagtcct cctagactat caggggatat 3300 gctctcttgg ccaaggtagg aatagttgca gtgtcatctg gcacaaacca tttcaaacgg 3360 cctggctgag gttatgcctt cgggaacctg aagtctttgt gtggttgtct ccaagtgtct 3420 gtggagctcc aggcggctgg tgctgacaga cgctttgtct agttggctgt ttgacttttg 3480 cttaagcagc cagggcagta gagtctaaca gatgctaatt tcaggatcag gaagactgta 3540 gaaaaatgag catcaagaag cccctggtac ccaaagctgc tcttgccaat gagtgaacct 3600 ctgccttccc gcttccaggt cctgtcttga agaagaggtt ctggatcccg tgaatccgtt 3660 cgtgcagctg ctgagcggag tcgtgtggct catccgcaat ggaaacatct acatcaacga 3720 gagccaagcc atcgagtgtg acgagacaca ggagacaggt caggaagcac aggtgttctg 3780 ttttatttgt attaggtttt gatttgttta ttttgtgcat gcagcgggtg catgcatgct 3840 cctttccttt cgccatgtga gtcctgagta ttgaactcag actgttaagt gtgatgggag 3900 gcactttacc cactgagcca ctttcccagc cctcagcatc agctttcttc agacccagga 3960 acagtgtgag tgggttattc tttagtgttc ccaaacattt actgagcagc tatttactgt 4020 ttagcactat ggtgagagtc ctagggattc agtcttatgt agaatataga aggagaatcc 4080 ttggcaataa gctggaaaat tgtgacaagt gccaagaaag aaacaggaga aaggggaccg 4140 gtggggacca gaagcacagg tatgaggaaa gtgcctgcag atttgctgta tggtggcctc 4200 cacatggcct aggagtttgt cataaatgca gagccatgag tccaccctcc ctatacctcc 4260 catccagaaa ccactggtta aatcctaaca acttgggtgt gcaggcactc ccttggtgac 4320 tctgatggac actcaaggtc aagggccact tggggatggg ctgatgagtt ggcttggtca 4380 gtaaagtatt tgccttgaaa gtgtgaggac ctgagttgga gccccagaaa gaaacattaa 4440 aagccaagtg ctgggatgca cacttgcatt cccagggatg gagctggaag gcagggatag 4500 gcagatccac ggccacacgg tgatattcta agctaacaag agacctgtct cacacagaaa 4560 gtgggtggca cctgaggacc aacacccagg gttatcctct gacgtacctc cagagtggaa 4620 aatactgggg tggtggaaaa ggacactttg gtcctgggaa tctggctatt cagggtatag 4680 tgtagaggga gagggagact caagaggctg tctttgagtc aaaggaacaa gctatcagaa 4740 gaactcaggg cagaggcctg tggttcccag gctcagggca gccttcaagg ccctaggcag 4800 agagtagctg ctgggtgaac aagtacagaa gtgaggcctg gggcctcagg caaggcctgt 4860 gaaatccttc caccaacata gaagtttctg gagactgaga tcacatgaag tgcttctggc 4920 tgtggcatgg aagctcactg gaggtggagc tgggatgtgg ctcagtgatc cagtgcttgc 4980 cacacgtgca cgagggaagg agccatcaaa agagagaaag tcgggagacc tgaggggtcc 5040 cctggagagc tgggtaacca ccccgggccc ttctccttta ggttctttta gcaaatttgt 5100 gaatgtgatg tcagccagga cagccgtggg tcatgaccgt gaggggcagc ttatcctctt 5160 ccatgctgat ggacagacgg aacagcgtgg tgagtcccag gaaccttggg gctgtttgca 5220 cttcagccac cctacctttc cagtcggttc tggggtattg gtgggacaag acagctttcc 5280 ggccattttg gaagtttcat ctggaggcaa tagcatttac ctactagtga aagaagccag 5340 ttaagccaga gaccacaggg gctcaagctg cataccccct ctgcacagcc ttaacctatg 5400 ggagatggca gagttcctgc gtcaacaaga tgtcgtcaat gccatcaacc tggatggagg 5460 cggttctgct acttttgtgc tcaatgggac cctggccagt tacccttcag atcactggta 5520 agaacccttg agccaccttt gtggctctct cagactgtct cactcagtca atactgagac 5580 cctgttgtgt gccaggccct gggtatccaa aagtgagcag aagagccgag atctcttccc 5640 tcagggtgct gcacagccca tccctggaaa cctgagacag gtcaggaaag gcctccctga 5700 ggacagtgaa gtaagacctg aggagatggc tggccggggt tgagagagcc tttaccggaa 5760 gacaaactgt acgcaatggg gaaatccgct aagtggccca gggagaggct ggagctatag 5820 ctcaggagga aaagtacttg cctcgcaagc gaaggacctg agtttaaact ccaaaaccca 5880 tataaaaagc cagatacgag caagtggcac atgcttgcag tcccagcctt gttgaggaag 5940 agtcaggtga atcctgaccc tctggccagc cagcctagcc tactttttgg caaggtccag 6000 gccagcgaga aagataaata aaataaagtt ttaaatgaca tgtatctaag gttgtcctga 6060 ctccatatgc gcacgcacgc atgcacgcac gcacaactgg cagaatggaa agggaggcaa 6120 actggacagc ctttataggc tgcggcaggg accagcacca aggcctagac ctcgtctcac 6180 agtgaatccc ccacagccag gacaacatgt ggcgctgtcc ccgccaagtg tccactgtgg 6240 tgtgtgtgca tgaaccgcgc tgccagccac ccgactgcag tggccatggg acctgtgtgg 6300 atggccactg tgaatgcacc agccacttct ggcggggcga ggcctgcagc gagctggact 6360 gtggcccctc caactgcagc cagcatgggc tgtgcacaga gagtgagtgg ggagcccaca 6420 ggagggtggt gctctggcgg gaccccagct cgcccatgct agactcccgc ctgtgtcctt 6480 acccagcctc tgtggtcttg ctttggtagc tggctgccac tgtgatgctg ggtggacagg 6540 atccaactgc agtgaaggtg agagctgcct gcaaacactc ctggagaggg tggcctggct 6600 gcacgcagct ggtatgacgc cttcgtccct ccttctggct tggaacttac cttcagagcc 6660 ttttctcatt tcgcatgtgg atacccgatg ttctacctac tgaaagagcc cacaagtagg 6720 aagccagatt ttcagtattg tcactcaact ctaaggacca atagcaaaaa aacaaagtgg 6780 ccacgcccct gagggagatc caccaaagtc cttaactcct ggaaagcagc tcctggtgat 6840 cctaggcatg ggtagggtgg tttcagcatc agctcagtgg agttcccatt cataatttct 6900 tcatcctttt aaggtcataa gttctagagc ccaccttaaa tctaggcagt attcttggtg 6960 tttatctgag acaaagtctt atacagccca cgcagttctc taacttagta tgtaaccgag 7020 aatggcctca agcaacctgc ttcctccttt caagcgctgg gattataggc atagcaccaa 7080 cttatagggt gctagaagtc aaacccaggg ccctatgtat atgcagcaag cactctagaa 7140 actggaacac agccctgttt gcagcccggt taccttggag ggttgggtcc cagggatctg 7200 agggcatctc cttcagcatg gccatgtgca cacccaggag ccaggctgtc tgtgacagga 7260 gaccatgcca cccaaggtga gacctccctg ccaccatctc ctctccacag agtgtcctct 7320 gggctggtat gggccaggtt gccagaggcc ctgccagtgt gagcaccagt gtttctgtga 7380 cccgcagact ggcaactgca gcatctccca aggtatgcgg ccttaaaggt tcttgagctg 7440 ggagcccttg gggcaggtct ggggtaggtg gactctcccc agcccttctt tctggtgtct 7500 tgcagtgagg cagtgtctcc agccaactga ggctacgccg agggcaggag agctggcctc 7560 tttcaccagg taagtgtttt agcaggcact gagcccctat gtctcatccg tgaggcacta 7620 gccaggccag gaggtcacag gttaccctct actttgcaag ctcagggaca gtcacaggta 7680 aaactggcat ccaggaaaga ccctgagcta cccagtggaa ctcaaaggta gcaggctatg 7740 ggtgtcatgc ctctggctgc agagactcca cttagatgct ggagcagggc catagagaca 7800 ggaaggactc accttatttc tgaactcttc cgtgtgttca ggctttgtgt tgttgttgct 7860 tcctttctgc tgtttcctgg gtttccagct ccatccccac agggctcatg gaaagaattg 7920 tgaagcaggg ggtgtggctc aattggcaga ttgattgcct ggcatgcaga aagccctagg 7980 ttcaatcccc agcatttcat atcataaccc aggcatggtg gcatcatgtg cctgtaagtc 8040 cagcacttgg gaggtagaag cagaaaagcc acgagtttaa gaatgttagg gagtcttagg 8100 ccaacctggg atacctaaga caagagatag atgtagggag atagattgac agacagacag 8160 acagacagac agacagacag atcttgagct ggaccttctg gcacaagcct gtcatcctag 8220 ctattccagg aagctgaagc aggaagatag caaattcaag gccagcttaa gccacagatt 8280 gagttcaaga tcaacctgag caactttatg aaatcctatt ataacataaa aagtaggggt 8340 gggaggttag gctgtagctc agtggtagag tgattgccta gcacgcacaa gacccaggtt 8400 caattcccag tactgcaaaa aatatattag gaacccccta aaagcagtaa cattcacatt 8460 agatgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgttttg 8520 ttgggtattt atttcattta catttccaat gctatcccaa aagtccccca catcctcccc 8580 cacccaccac cttgtttttt tttttttttt tttttttttt tttgacctga aactcacagg 8640 ttaggttaga caagctgact ggtgagctcc aacttccaac gtaccatcat gcctggcttt 8700 tgttttggtg tctctgtgta accctggatg tcctggagct ctctctgtag accagcctgg 8760 ccttaaactc acagaaaccc acctgtttct gcctcccatg tgctgggatt aaaggcgtgt 8820 gccacctcac ccagccctgc tggacttaaa ttgggtcttc attttataag acaagcatga 8880 gctaattccc cagttcctaa aatgttttta acatccttaa acatcagaga ctgtctgtgg 8940 tattccctcc atgtgtcttc agtataccta ctcccctccc tgcctactgg gttcaacatg 9000 cccagtttgg gttctggctg cctgccccca ctcaagactc tcttttccat ctcaggacca 9060 cctggctagc cctcaccctg acactaattt tcctgctgct gatcagcact ggggtcaacg 9120 tgtccttgtt cctgggctcc agggccgaga ggaaccggca cctcgacggg gactatgtgt 9180 atcacccact gcaggaggtg aacggggaag cgctgactgc agagaaggag cacatggagg 9240 aaactagcaa ccccttcaag gactgaagag ctgccccaac ggcatgctcc agataatctt 9300 gtccctgctc ctcacttcca caggggacat tgtgaggcca ctggcatgga tgctatgcac 9360 cccacccttt gctggccata ttcctcctgt ccccatgctg tggctcatgc caacctagca 9420 ataaggagct ctggagagcc tgcacctgcc tcccgctcgc ctatatctgc tgcccagagg 9480 cctgtctcgc acaggggtct cgccactgcc aaagactccc aggaagtcaa agactcccag 9540 taatccacta gcaaatggaa ctctgtaacg ccatcataac aagagtggcc actctccgcg 9600 tgcacaggta tgaaatataa atccttacac acacacacac acacaccctc ggctcagcca 9660 cggcactcgc cttttataca gcgtcatcgc tggacagcca actagaactc tgcatcctgt 9720 cacaggaagc acctcataag aaggaatggg gagggaaggc agtcgccttg ttttcagacc 9780 ttagccgaat tc 9792 22 20 PRT Artificial Sequence synthetic peptide 22 Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 1 5 10 15 Gly Ser Thr Gly 20 23 13 PRT Artificial Sequence synthetic peptide 23 Asp Glu Asp Gln Val Asp Pro Arg Leu Ile Asp Gly Lys 1 5 10 24 2279 DNA Homo sapiens 24 agctaaggca ggtacctgca tccttgtttt tgtttagtgg atcctctatc cttcagagac 60 tctggaaccc ctgtggtctt ctcttcatct aatgaccctg aggggatgga gttttcaagt 120 ccttccagag aggaatgtcc caagcctttg agtagggtaa gcatcatggc tggcagcctc 180 acaggattgc ttctacttca ggcagtgtcg tgggcatcag gtgcccgccc ctgcatccct 240 aaaagcttcg gctacagctc ggtggtgtgt gtctgcaatg ccacatactg tgactccttt 300 gaccccccga cctttcctgc ccttggtacc ttcagccgct atgagagtac acgcagtggg 360 cgacggatgg agctgagtat ggggcccatc caggctaatc acacgggcac aggcctgcta 420 ctgaccctgc agccagaaca gaagttccag aaagtgaagg gatttggagg ggccatgaca 480 gatgctgctg ctctcaacat ccttgccctg tcaccccctg cccaaaattt gctacttaaa 540 tcgtacttct ctgaagaagg aatcggatat aacatcatcc gggtacccat ggccagctgt 600 gacttctcca tccgcaccta cacctatgca gacacccctg atgatttcca gttgcacaac 660 ttcagcctcc cagaggaaga taccaagctc aagatacccc tgattcaccg agccctgcag 720 ttggcccagc gtcccgtttc actccttgcc agcccctgga catcacccac ttggctcaag 780 accaatggag cggtgaatgg gaaggggtca ctcaagggac agcccggaga catctaccac 840 cagacctggg ccagatactt tgtgaagttc ctggatgcct atgctgagca caagttacag 900 ttctgggcag tgacagctga aaatgagcct tctgctgggc tgttgagtgg ataccccttc 960 cagtgcctgg gcttcacccc tgaacatcag cgagacttca ttgcccgtga cctaggtcct 1020 accctcgcca acagtactca ccacaatgtc cgcctactca tgctggatga ccaacgcttg 1080 ctgctgcccc actgggcaaa ggtggtactg acagacccag aagcagctaa atatgttcat 1140 ggcattgctg tacattggta cctggacttt ctggctccag ccaaagccac cctaggggag 1200 acacaccgcc tgttccccaa caccatgctc tttgcctcag aggcctgtgt gggctccaag 1260 ttctgggagc agagtgtgcg gctaggctcc tgggatcgag ggatgcagta cagccacagc 1320 atcatcacga acctcctgta ccatgtggtc ggctggaccg actggaacct tgccctgaac 1380 cccgaaggag gacccaattg ggtgcgtaac tttgtcgaca gtcccatcat tgtagacatc 1440 accaaggaca cgttttacaa acagcccatg ttctaccacc ttggccactt cagcaagttc 1500 attcctgagg gctcccagag agtggggctg gttgccagtc agaagaacga cctggacgca 1560 gtggcactga tgcatcccga tggctctgct gttgtggtcg tgctaaaccg ctcctctaag 1620 gatgtgcctc ttaccatcaa ggatcctgct gtgggcttcc tggagacaat ctcacctggc 1680 tactccattc acacctacct gtggcgtcgc cagtgatgga gcagatactc aaggaggcac 1740 tgggctcagc ctgggcatta aagggacaga gtcagctcac acgctgtctg tgactaaaga 1800 gggcacagca gggccagtgt gagcttacag cgacgtaagc ccaggggcaa tggtttgggt 1860 gactcacttt cccctctagg tggtgccagg ggctggaggc ccctagaaaa agatcagtaa 1920 gccccagtgt ccccccagcc cccatgctta tgtgaacatg cgctgtgtgc tgcttgcttt 1980 ggaaactggg cctgggtcca ggcctagggt gagctcactg tccgtacaaa cacaagatca 2040 gggctgaggg taaggaaaag aagagactag gaaagctggg cccaaaactg gagactgttt 2100 gtctttcctg gagatgcaga actgggcccg tggagcagca gtgtcagcat cagggcggaa 2160 gccttaaagc agcagcgggt gtgcccaggc acccagatga ttcctatggc accagccagg 2220 aaaaatggca gctcttaaag gagaaaatgt ttgagcccaa aaaaaaaaaa aaaaaaaaa 2279 25 536 PRT Homo sapiens 25 Met Glu Phe Ser Ser Pro Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser 1 5 10 15 Arg Val Ser Ile Met Ala Gly Ser Leu Thr Gly Leu Leu Leu Leu Gln 20 25 30 Ala Val Ser Trp Ala Ser Gly Ala Arg Pro Cys Ile Pro Lys Ser Phe 35 40 45 Gly Tyr Ser Ser Val Val Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser 50 55 60 Phe Asp Pro Pro Thr Phe Pro Ala Leu Gly Thr Phe Ser Arg Tyr Glu 65 70 75 80 Ser Thr Arg Ser Gly Arg Arg Met Glu Leu Ser Met Gly Pro Ile Gln 85 90 95 Ala Asn His Thr Gly Thr Gly Leu Leu Leu Thr Leu Gln Pro Glu Gln 100 105 110 Lys Phe Gln Lys Val Lys Gly Phe Gly Gly Ala Met Thr Asp Ala Ala 115 120 125 Ala Leu Asn Ile Leu Ala Leu Ser Pro Pro Ala Gln Asn Leu Leu Leu 130 135 140 Lys Ser Tyr Phe Ser Glu Glu Gly Ile Gly Tyr Asn Ile Ile Arg Val 145 150 155 160 Pro Met Ala Ser Cys Asp Phe Ser Ile Arg Thr Tyr Thr Tyr Ala Asp 165 170 175 Thr Pro Asp Asp Phe Gln Leu His Asn Phe Ser Leu Pro Glu Glu Asp 180 185 190 Thr Lys Leu Lys Ile Pro Leu Ile His Arg Ala Leu Gln Leu Ala Gln 195 200 205 Arg Pro Val Ser Leu Leu Ala Ser Pro Trp Thr Ser Pro Thr Trp Leu 210 215 220 Lys Thr Asn Gly Ala Val Asn Gly Lys Gly Ser Leu Lys Gly Gln Pro 225 230 235 240 Gly Asp Ile Tyr His Gln Thr Trp Ala Arg Tyr Phe Val Lys Phe Leu 245 250 255 Asp Ala Tyr Ala Glu His Lys Leu Gln Phe Trp Ala Val Thr Ala Glu 260 265 270 Asn Glu Pro Ser Ala Gly Leu Leu Ser Gly Tyr Pro Phe Gln Cys Leu 275 280 285 Gly Phe Thr Pro Glu His Gln Arg Asp Phe Ile Ala Arg Asp Leu Gly 290 295 300 Pro Thr Leu Ala Asn Ser Thr His His Asn Val Arg Leu Leu Met Leu 305 310 315 320 Asp Asp Gln Arg Leu Leu Leu Pro His Trp Ala Lys Val Val Leu Thr 325 330 335 Asp Pro Glu Ala Ala Lys Tyr Val His Gly Ile Ala Val His Trp Tyr 340 345 350 Leu Asp Phe Leu Ala Pro Ala Lys Ala Thr Leu Gly Glu Thr His Arg 355 360 365 Leu Phe Pro Asn Thr Met Leu Phe Ala Ser Glu Ala Cys Val Gly Ser 370 375 380 Lys Phe Trp Glu Gln Ser Val Arg Leu Gly Ser Trp Asp Arg Gly Met 385 390 395 400 Gln Tyr Ser His Ser Ile Ile Thr Asn Leu Leu Tyr His Val Val Gly 405 410 415 Trp Thr Asp Trp Asn Leu Ala Leu Asn Pro Glu Gly Gly Pro Asn Trp 420 425 430 Val Arg Asn Phe Val Asp Ser Pro Ile Ile Val Asp Ile Thr Lys Asp 435 440 445 Thr Phe Tyr Lys Gln Pro Met Phe Tyr His Leu Gly His Phe Ser Lys 450 455 460 Phe Ile Pro Glu Gly Ser Gln Arg Val Gly Leu Val Ala Ser Gln Lys 465 470 475 480 Asn Asp Leu Asp Ala Val Ala Leu Met His Pro Asp Gly Ser Ala Val 485 490 495 Val Val Val Leu Asn Arg Ser Ser Lys Asp Val Pro Leu Thr Ile Lys 500 505 510 Asp Pro Ala Val Gly Phe Leu Glu Thr Ile Ser Pro Gly Tyr Ser Ile 515 520 525 His Thr Tyr Leu Trp Arg Arg Gln 530 535 26 536 PRT Homo sapiens 26 Met Glu Phe Ser Ser Pro Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser 1 5 10 15 Arg Val Ser Ile Met Ala Gly Ser Leu Thr Gly Leu Leu Leu Leu Gln 20 25 30 Ala Val Ser Trp Ala Ser Gly Ala Arg Pro Cys Ile Pro Lys Ser Phe 35 40 45 Gly Tyr Ser Ser Val Val Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser 50 55 60 Phe Asp Pro Pro Thr Phe Pro Ala Leu Gly Thr Phe Ser Arg Tyr Glu 65 70 75 80 Ser Thr Arg Ser Gly Arg Arg Met Glu Leu Ser Met Gly Pro Ile Gln 85 90 95 Ala Asn His Thr Gly Thr Gly Leu Leu Leu Thr Leu Gln Pro Glu Gln 100 105 110 Lys Phe Gln Lys Val Lys Gly Phe Gly Gly Ala Met Thr Asp Ala Ala 115 120 125 Ala Leu Asn Ile Leu Ala Leu Ser Pro Pro Ala Gln Asn Leu Leu Leu 130 135 140 Lys Ser Tyr Phe Ser Glu Glu Gly Ile Gly Tyr Asn Ile Ile Arg Val 145 150 155 160 Pro Met Ala Ser Cys Asp Phe Ser Ile Arg Thr Tyr Thr Tyr Ala Asp 165 170 175 Thr Pro Asp Asp Phe Gln Leu His Asn Phe Ser Leu Pro Glu Glu Asp 180 185 190 Thr Lys Leu Lys Ile Pro Leu Ile His Arg Ala Leu Gln Leu Ala Gln 195 200 205 Arg Pro Val Ser Leu Leu Ala Ser Pro Trp Thr Ser Pro Thr Trp Leu 210 215 220 Lys Thr Asn Gly Ala Val Asn Gly Lys Gly Ser Leu Lys Gly Gln Pro 225 230 235 240 Gly Asp Ile Tyr His Gln Thr Trp Ala Arg Tyr Phe Val Lys Phe Leu 245 250 255 Asp Ala Tyr Ala Glu His Lys Leu Gln Phe Trp Ala Val Thr Ala Glu 260 265 270 Asn Glu Pro Ser Ala Gly Leu Leu Ser Gly Tyr Pro Phe Gln Cys Leu 275 280 285 Gly Phe Thr Pro Glu His Gln Arg Asp Phe Ile Ala Arg Asp Leu Gly 290 295 300 Pro Thr Leu Ala Asn Ser Thr His His Asn Val Arg Leu Leu Met Leu 305 310 315 320 Asp Asp Gln Arg Leu Leu Leu Pro His Trp Ala Lys Val Val Leu Thr 325 330 335 Asp Pro Glu Ala Ala Lys Tyr Val His Gly Ile Ala Val His Trp Tyr 340 345 350 Leu Asp Phe Leu Ala Pro Ala Lys Ala Thr Leu Gly Glu Thr His Arg 355 360 365 Leu Phe Pro Asn Thr Met Leu Phe Ala Ser Glu Ala Cys Val Gly Ser 370 375 380 Lys Phe Trp Glu Gln Ser Val Arg Leu Gly Ser Trp Asp Arg Gly Met 385 390 395 400 Gln Tyr Ser His Ser Ile Ile Thr Asn Leu Leu Tyr His Val Val Gly 405 410 415 Trp Thr Asp Trp Asn Leu Ala Leu Asn Pro Glu Gly Gly Pro Asn Trp 420 425 430 Val Arg Asn Phe Val Asp Ser Pro Ile Ile Val Asp Ile Thr Lys Asp 435 440 445 Thr Phe Tyr Lys Gln Pro Met Phe Tyr His Leu Gly His Phe Ser Lys 450 455 460 Phe Ile Pro Glu Gly Ser Gln Arg Val Gly Leu Val Ala Ser Gln Lys 465 470 475 480 Asn Asp Leu Asp Ala Val Ala Leu Met His Pro Asp Gly Ser Ala Val 485 490 495 Val Val Val Leu Asn Arg Ser Ser Lys Asp Val Pro Leu Thr Ile Lys 500 505 510 Asp Pro Ala Val Gly Phe Leu Glu Thr Ile Ser Pro Gly Tyr Ser Ile 515 520 525 His Thr Tyr Leu Trp His Arg Gln 530 535 27 6 PRT Artificial Sequence synthetic peptide 27 Arg Ala Arg Tyr Lys Arg 1 5 

What is claimed:
 1. A method of preparing a highly phosphorylated acid β-glucocerebrosidase comprising: (a) contacting an acid β-glucocerebrosidase with an isolated GlcNAc phosphotransferase to produce a modified acid β-glucocerebrosidase; and (b) contacting said modified acid β-glucocerebrosidase with an isolated phosphodiester α-GlcNAcase.
 2. The method of claim 1, further comprising purifying said highly phosphorylated acid β-glucocerebrosidase after said contacting with the isolated phosphodiester α-GlcNAcase.
 3. The method of claim 1, further comprising purifying said modified acid β-glucocerebrosidase prior to said contacting with the isolated phosphodiester α-GlcNAcase.
 4. The method of claim 1, wherein said isolated GlcNAc phosphotransferase comprises an α subunit and β subunit.
 5. The method of claim 4, wherein the GlcNAc phosphotranferase comprises the amino acid of SEQ ID NO:2.
 6. The method of claim 1, wherein the GlcNAc phosphotranferase comprises SEQ ID NO:4 and SEQ ID NO:5.
 7. The method of claim 1, wherein the GlcNAc phosphotranferase is encoded by a nucleotide sequence comprising SEQ ID NO:1, or a sequence that hybridizes under stringent conditions to SEQ ID NO:1.
 8. The method of claim 1, The method of claim 1, wherein the GlcNAc phosphotranferase is encoded by a nucleotide sequence comprising SEQ ID NO:3, or a sequence that hybridizes under stringent conditions to SEQ ID NO:3.
 9. The method of claim 1, wherein the phosphodiester α-GlcNAcase comprises SEQ ID NO:17 or a sequence that hybridizes under stringent conditions to SEQ ID NO:17.
 10. The method of claim 1, wherein the acid β-glucocerebrosidase comprises the amino acid sequence of SEQ ID NO:25 or SEQ ID NO:26.
 11. The method of claim 1, wherein the acid-β-glucocerebrosidase comprises the amino acid sequence of SEQ ID NO:26.
 12. A highly phosphorylated acid β-glucocerebrosidase obtained by the method of claim
 1. 13. A pharmaceutical composition comprising the highly phosphorylated acid β-glucocerebrosidase of claim 12 and a pharmaceutically acceptable carrier.
 14. A method of treating a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 12 in an amount sufficient to treat said disease.
 15. The method of claim 14, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 16. A method of treating a bone tissue of a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 12 in an amount sufficient to treat said disease.
 17. The method of claim 16, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 18. A method of treating a lung tissue of a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 12 in an amount sufficient to treat said disease.
 19. The method of claim 18, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 20. A method of producing a highly phosphorylated acid β-glucocerebrosidase comprising: (a) culturing transfected cells comprising a recombinant polynucleotide which encodes a recombinant acid β-glucocerebrosidase in the presence of at least one α 1,2-mannosidase inhibitor; (b) recovering a high mannose recombinant acid β-glucocerebrosidase from said transfected cell; (c) contacting said high mannose recombinant acid β-glucocerebrosidase with an isolated GlcNAc phosphotransferase to produce a modified acid β-glucocerebrosidase; and (d) contacting said modified acid β-glucocerebrosidase with an isolated phosphodiester α-GlcNAcase.
 21. The method of claim 20, wherein said at least one 1,2-mannosidase inhibitor is selected from the group consisting of deoxymannojirimycin, kifunensine, D-Mannonolactam amidrazone, and N-butyl-deoxymannojirimycin.
 22. The method of claim 21, wherein the 1,2-mannosidase inhibitor is kifunensine.
 23. The method of claim 21, wherein the 1,2 mannosidase inhibitor is deoxymannojirimycin.
 24. The method of claim 21, wherein the at least one 1,2 mannosidase inhibitor is deoxymannojirimycin and kifunensine.
 25. The method of claim 20, further comprising purifying said modified acid β-glucocerebrosidase prior to said contacting with the isolated phosphodiester α-GlcNAcase.
 26. The method of claim 20, wherein said isolated GlcNAc phosphotransferase comprises an α subunit and β subunit.
 27. The method of claim 26, wherein the GlcNAc phosphotranferase comprises the amino acid SEQ ID NO:2.
 28. The method of claim 20, wherein the GlcNAc phosphotranferase comprises SEQ ID NO:4 and SEQ ID NO:5.
 29. The method of claim 20, wherein the GlcNAc phosphotranferase is encoded by a nucleotide sequence comprising SEQ ID NO:1, or a sequence that hybridizes under stringent conditions to SEQ ID NO:1.
 30. The method of claim 20, The method of claim 1, wherein the GlcNAc phosphotranferase is encoded by a nucleotide sequence comprising SEQ ID NO:3, or a sequence that hybridizes under stringent conditions to SEQ ID NO:3.
 31. The method of claim 20, wherein the phosphodiester α-GlcNAcase comprises SEQ ID NO:17 or a sequence that hybridizes under stringent conditions to SEQ ID NO:17.
 32. The method of claim 20, wherein the acid β-glucocerebrosidase comprises the amino acid sequence of SEQ ID NO:25 or SEQ ID NO:26.
 33. The method of claim 20, wherein the acid-β-glucocerebrosidase comprises the amino acid sequence of SEQ ID NO:26.
 34. A highly phosphorylated acid β-glucocerebrosidase obtained by the method of claim
 20. 35. A pharmaceutical composition comprising the highly phosphorylated acid β-glucocerebrosidase of claim 34 and a pharmaceutically acceptable carrier.
 36. A method of treating a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 34 in an amount sufficient to treat said disease.
 37. The method of claim 36, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 38. A method of treating a bone tissue of a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 34 in an amount sufficient to treat said disease.
 39. The method of claim 38, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 40. A method of treating a lung tissue of a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 34 in an amount sufficient to treat said disease.
 41. The method of claim 40, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 42. A highly phosphorylated acid β-glucocerebrosidase, which is encoded by the nucleotide sequence of SEQ ID NO:24 of a nucleotide sequence that hybridizes to the nucleotide sequence of SEQ ID NO:24.
 43. The highly phosphorylated acid β-glucocerebrosidase of claim 28, which comprises the amino acid sequence of SEQ ID NO:25 or SEQ ID NO:26.
 44. The highly phosphorylated acid β-glucocerebrosidase of claim 43, which comprises the amino acid sequence of SEQ ID NO:26.
 45. A pharmaceutical composition comprising the highly phosphorylated acid β-glucocerebrosidase of claim 42 and a pharmaceutically acceptable carrier.
 46. A method of treating a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 42 in an amount sufficient to treat said disease.
 47. The method of claim 46, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 48. A method of treating a bone tissue of a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 42 in an amount sufficient to treat said disease.
 49. The method of claim 48, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 50. A method of treating a lung tissue of a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 42 in an amount sufficient to treat said disease.
 51. The method of claim 50, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 52. A method of preparing a highly phosphorylated acid β-glucocerebrosidase comprising: (i) a step for transferring a N-acetylglucosamine-1-phosphate from UDP-GlcNAc to an acid β-glucocerebrosidase; (ii) a step for removing an N-acetylglucosamine from said acid β-glucocerebrosidase.
 53. A highly phosphorylated acid β-glucocerebrosidase obtained by the method of claim
 52. 54. The highly phosphorylated acid β-glucocerebrosidase of claim 53, which comprises the amino acid sequence of SEQ ID NO:26.
 55. A pharmaceutical composition comprising the highly phosphorylated acid β-glucocerebrosidase of claim 53 and a pharmaceutically acceptable carrier.
 56. A method of treating a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 53 in an amount sufficient to treat said disease.
 57. The method of claim 56, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 58. A method of treating a bone tissue of a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 53 in an amount sufficient to treat said disease.
 59. The method of claim 58, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated.
 60. A method of treating a lung tissue of a patient suffering from Gaucher's disease, comprising administering to the patient in need thereof the highly phosphorylated acid beta-glucocerebrosidase of claim 53 in an amount sufficient to treat said disease.
 61. The method of claim 60, further comprising administering acid β-glucocerebrosidase which is not highly phosphorylated. 